Gene TBFG_10020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10020 
Symbol 
ID5220682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp23962 
End bp25545 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content66% 
IMG OID640604759 
Producthypothetical protein 
Protein accessionYP_001285965 
Protein GI148821211 
COG category[T] Signal transduction mechanisms 
COG ID[COG1716] FOG: FHA domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones321 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones217 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAGCC AGAAAAGGCT GGTTCAGCGC GTTGAGCGCA AACTCGAGCA GACGGTTGGC 
GATGCGTTTG CCCGCATCTT TGGAGGCTCG ATCGTCCCGC AAGAGGTCGA AGCCCTGCTG
CGCCGCGAGG CGGCCGACGG CATCCAGTCG CTGCAGGGAA ATCGCCTTTT GGCGCCCAAC
GAATACATCA TTACCCTCGG TGTGCACGAC TTTGAGAAGT TGGGCGCTGA TCCTGAGCTG
AAGTCAACCG GTTTTGCTCG GGACTTGGCG GACTATATCC AAGAACAGGG GTGGCAAACG
TATGGTGATG TGGTCGTCCG ATTCGAGCAG TCGTCGAACC TGCATACCGG CCAGTTCCGC
GCCCGCGGCA CTGTTAACCC CGACGTTGAG ACCCACCCGC CGGTCATCGA TTGCGCCCGG
CCACAATCAA ACCACGCGTT TGGCGCAGAA CCAGGAGTAG CACCAATGAG TGACAATTCG
AGCTACCGTG GCGGTCAGGG GCAGGGGCGT CCCGACGAGT ATTACGACGA CCGCTATGCG
CGTCCGCAAG AGGATCCGCG TGGTGGCCCG GATCCGCAAG GCGGATCTGA CCCCCGCGGG
GGGTATCCAC CCGAGACGGG CGGCTACCCG CCCCAGCCGG GCTACCCACG CCCGCGCCAC
CCGGACCAGG GCGACTACCC CGAGCAAATC GGGTACCCCG ACCAGGGCGG TTACCCCGAG
CAACGCGGTT ACCCCGAGCA ACGCGGCTAC CCCGACCAGC GCGGGTACCA GGACCAGGGT
CGAGGCTACC CCGACCAAGG GCAGGGGGGC TATCCGCCGC CCTACGAGCA ACGCCCTCCT
GTTTCTCCCG GCCCGGCTGC CGGCTACGGC GCTCCCGGCT ACGACCAGGG CTATCGCCAA
AGCGGCGGCT ACGGCCCTTC ACCCGGTGGC GGCCAGCCCG GCTACGGCGG GTACGGGGAG
TACGGGCGTG GCCCGGCTCG CCACGAGGAG GGCAGCTATG TGCCCTCTGG CCCTCCGGGC
CCGCCCGAGC AACGACCGGC TTACCCCGAC CAAGGCGGTT ACGACCAGGG CTACCAGCAA
GGCGCCACGA CATACGGCCG GCAAGACTAT GGCGGCGGCG CTGACTACAC CCGCTACACC
GAATCCCCGC GGGTCCCGGG ATACGCTCCT CAGGGTGGCG GGTACGCCGA ACCCGCCGGC
CGAGACTACG ACTACGGCCA ATCAGGCGCT CCGGACTACG GTCAGCCAGC GCCCGGTGGC
TACAGCGGTT ACGGGCAGGG CGGCTATGGG TCCGCCGGAA CGTCGGTTAC GCTGCAGCTC
GACGACGGCA GCGGACGCAC TTACCAGCTC CGCGAGGGCT CCAACATCAT CGGTCGCGGA
CAGGACGCCC AGTTCCGGCT GCCCGACACC GGTGTGTCAC GCCGTCACTT GGAGATCCGG
TGGGACGGGC AGGTCGCATT GCTCGCAGAC CTGAACTCCA CCAACGGCAC CACTGTTAAC
AATGCACCGG TACAGGAGTG GCAGTTGGCC GACGGTGATG TGATCCGCTT GGGACACTCC
GAGATCATCG TCCGCATGCA CTGA
 
Protein sequence
MGSQKRLVQR VERKLEQTVG DAFARIFGGS IVPQEVEALL RREAADGIQS LQGNRLLAPN 
EYIITLGVHD FEKLGADPEL KSTGFARDLA DYIQEQGWQT YGDVVVRFEQ SSNLHTGQFR
ARGTVNPDVE THPPVIDCAR PQSNHAFGAE PGVAPMSDNS SYRGGQGQGR PDEYYDDRYA
RPQEDPRGGP DPQGGSDPRG GYPPETGGYP PQPGYPRPRH PDQGDYPEQI GYPDQGGYPE
QRGYPEQRGY PDQRGYQDQG RGYPDQGQGG YPPPYEQRPP VSPGPAAGYG APGYDQGYRQ
SGGYGPSPGG GQPGYGGYGE YGRGPARHEE GSYVPSGPPG PPEQRPAYPD QGGYDQGYQQ
GATTYGRQDY GGGADYTRYT ESPRVPGYAP QGGGYAEPAG RDYDYGQSGA PDYGQPAPGG
YSGYGQGGYG SAGTSVTLQL DDGSGRTYQL REGSNIIGRG QDAQFRLPDT GVSRRHLEIR
WDGQVALLAD LNSTNGTTVN NAPVQEWQLA DGDVIRLGHS EIIVRMH