Gene Francci3_3747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3747 
Symbol 
ID3906031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4492272 
End bp4493306 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content74% 
IMG OID637881073 
Productthiamine monophosphate kinase 
Protein accessionYP_482827 
Protein GI86742427 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.635061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGTC ACGGGAGACC CCCGGCGGGG GCGACGCTGC GCGATCTGGG CGAGTTCGGC 
TTGATCGAGG CCGTGACGGC GCGTTTCTCC CATGGACGGG ACGTGCTGGT CGGCCCCGGG
GACGACGCGG CGGTGCTGGC CGCCCCGGAC GGTCGGGTGG TCGTCACCAC CGACCTGCTG
CTCGAAGGAC GCCACTTCCG CCGGGACTGG TCGTCCGCGG TGGATGTCGG GCACAAGGCC
GCCGCGCAGA ACCTCGCCGA CGTGGCGTCG ATGGGGGCGA GGCCCACCGG CCTGTTCCTC
GGCTTCGCCG CACCCGGCGA CCTCGCGGTC GACTGGGCCC TGGCGATGGC CGACGGAATG
GCCGAGGAAT GCGCGCAGGC CGGTGCCTCC GTGTCCGGCG GGGACGTCAG CTCCGCGGAT
TCGATCATGC TCGGGGTGAC CGCGCTCGGT GACCTGCAGG GCCGGCCGCC GGTCCTGCGC
TCGGGCGCCC GGCCCGGCGA TGCCGTGGTG CTCGCTGGTC GTCTCGGGTG GGCCCAGGCG
GGGCTGGCCC TACTCACGGC GGGTTCGGCA GCCGGCGACT CCCGATGGGT CGACGACCCG
AGATGGAAGG AACTGATCGC GGCGCACCGC CGGCCCCGGC CGCCCTACGC GCTCGGGCCG
GTGCTCGCCG CCGCGGGCGC GCACGCCATG TGCGACGTCT CGGACGGTCT CGTGGCGGAT
CTCGGTCACA TCGCCGTGGC CTCGGGGGTG TCCATCGATC TCGCGTCGGA AGCCTTCCTG
ATCGACGAGG CGATGCGGTC GGCCGGGGCC GCCCTTGATG TGGACCCGTT GCGCTGGGTA
CTCGGCGGCG GCGAGGATCA CCCGCTGGTC GCCTGCCTTC CCCCAGGAGT GACCCCGCCG
GCCGGGTGCA TCGTGATCGG TTCGGTCCTG AGCGGCGCGG GATCGGCCGG CGCGGGGGTG
ACAGTGGACG GGGAGCGCCC CGTAGGCAGC GATGGGTGGG ATCACTACGG GCAGTCCGCG
ACGGACGCAT CCTGA
 
Protein sequence
MGSHGRPPAG ATLRDLGEFG LIEAVTARFS HGRDVLVGPG DDAAVLAAPD GRVVVTTDLL 
LEGRHFRRDW SSAVDVGHKA AAQNLADVAS MGARPTGLFL GFAAPGDLAV DWALAMADGM
AEECAQAGAS VSGGDVSSAD SIMLGVTALG DLQGRPPVLR SGARPGDAVV LAGRLGWAQA
GLALLTAGSA AGDSRWVDDP RWKELIAAHR RPRPPYALGP VLAAAGAHAM CDVSDGLVAD
LGHIAVASGV SIDLASEAFL IDEAMRSAGA ALDVDPLRWV LGGGEDHPLV ACLPPGVTPP
AGCIVIGSVL SGAGSAGAGV TVDGERPVGS DGWDHYGQSA TDAS