Gene TBFG_10370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10370 
Symbol 
ID5221034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp446235 
End bp447365 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content68% 
IMG OID640605111 
Producthypothetical protein 
Protein accessionYP_001286315 
Protein GI148821561 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4833] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones337 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones208 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGG CAAACCGGGC AGCCAGCGCC GAAACCGCCG TCACGCAACG GCATCTGAGA 
CGGCTTTGGG CGTTGCCGGG CACCCAGTTG GCGGTGGTGG CTTGGCCGTC AACCCGGCGC
GACCGGTTGT TCGGCAGCTG GCACTACTGG TGGCAGGCAC ACCTGCTGGA TTGCCTGGTC
GACGCGCAGC TGCGCGACCC GCAGCCGCAG CGGCGCGCCC GGATCAACCG CCAGGTCCGC
TCGCACCGGG TCCGCAACAA TTTCTCGTGG CTCAACAGCT ATTACGACGA CATGGCGTGG
CTAGCGTTAG CGCTGGAACG TGCCGACCGG GTCGCCGGGG TACGACGCCG GCGCGCACTG
CCCAAGCTCA CCAACCAGTT CGTCGAAGCC TGGGTGCCCG AGGACGGCGG CGGCATCCCG
TGGCGCAAGC AGGACCAGTT CTTCAACGCC CCAGCCAACG GCCCGGCCGG GCTATTCCTG
GCCCGCTACC CAGACCAGTA CGGGAAAAGG CTCAAGCGCG CAGAACAGAT GGCCGACTGG
ATCGATCGCA CGCTGATCGA TCCGGAGACA CACCTGGTAT TCGACGGCAT CAAGGCCGGG
TCGTTGGTCC GCGCGCAGTA CACCTACTGC CAAGGGGTGG TGCTCGGGCT GGAAACCGAG
CTGGCGGTGC GCACCGGTCC GGCAGCCAGA GCGCGGCACT GCGCTCGCGT TCATCGCTTG
GTCGCGGCCG TCAACGAGCA CATGGCTCCA TTGGGTGTGT TACGGGGCGC CGGCGGCGGG
GACGGTGGCC TGTTCGCGGG GATCACCGCC CGATACCTCG CCTTGGTCGC CACCACGTTG
CCGGGCGACT CGGCCGACGA CGCCGCCGCC CGCGACACCG CCCGCGCGAT AGTGCTGGCT
AGCGCGCAAT CGGCGTGGGA TTACCGGCAA ACCGTGGACG GGTTGCCGGT GTTCGGGGCG
TTCTGGGATC GCGAAGCCGA GTTGCCCACC GCCGGCGGTG AGCAGGCGCG GTCCGTCCGA
GGAGCGGTGC ATAGCTCGGC GATTGCCGAG CGAGATCTGT CGGTGCAGCT ATCGGGTTGG
ATGCTGATGG AAGCCGCCCA CAGCGCCGCA GCGGTCAGCT CACTCGGGTA A
 
Protein sequence
MNLANRAASA ETAVTQRHLR RLWALPGTQL AVVAWPSTRR DRLFGSWHYW WQAHLLDCLV 
DAQLRDPQPQ RRARINRQVR SHRVRNNFSW LNSYYDDMAW LALALERADR VAGVRRRRAL
PKLTNQFVEA WVPEDGGGIP WRKQDQFFNA PANGPAGLFL ARYPDQYGKR LKRAEQMADW
IDRTLIDPET HLVFDGIKAG SLVRAQYTYC QGVVLGLETE LAVRTGPAAR ARHCARVHRL
VAAVNEHMAP LGVLRGAGGG DGGLFAGITA RYLALVATTL PGDSADDAAA RDTARAIVLA
SAQSAWDYRQ TVDGLPVFGA FWDREAELPT AGGEQARSVR GAVHSSAIAE RDLSVQLSGW
MLMEAAHSAA AVSSLG