Gene TBFG_11837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_11837 
Symbol 
ID5222514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp2056638 
End bp2057849 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content69% 
IMG OID640606594 
ProductPPE family protein 
Protein accessionYP_001287774 
Protein GI148823020 
COG category[N] Cell motility 
COG ID[COG5651] PPE-repeat proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones361 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones232 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCG CACTTGACTT CGCCACGCTA CCGCCCGAAA TCAACTCGGC GCGTATGTAT 
TCCGGCGCGG GCTCGGCCCC GATGCTGGCC GCAGCGTCAG CCTGGCACGG CTTGTCCGCA
GAACTGCGCG CCAGCGCACT GTCATACAGC TCGGTGCTTT CGACGCTGAC CGGTGAAGAA
TGGCACGGTC CGGCGTCGGC ATCGATGACA GCCGCGGCCG CCCCCTACGT GGCCTGGATG
AGCGTCACCG CCGTCCGGGC CGAGCAGGCC GGGGCACAGG CGGAGGCTGC CGCTGCAGCG
TACGAAGCCG CGTTCGCAGC AACGGTGCCC CCGCCGGTCA TCGAGGCCAA CCGCGCCCAG
CTCATGGCGC TGATCGCCAC CAATGTGCTA GGCCAAAACG CCCCCGCGAT CGCGGCCACC
GAGGCCCAGT ACGCCGAAAT GTGGTCCCAG GACGCGATGG CCATGTACGG CTACGCCGGC
GCCTCGGCAG CCGCTACCCA GCTGACCCCG TTCACCGAGC CGGTGCAGAC TACCAACGCG
TCCGGCCTGG CGGCCCAGTC GGCTGCGATT GCCCACGCCA CCGGCGCCTC GGCTGGTGCT
CAGCAAACGA CGCTGTCGCA GCTGATCGCC GCCATACCGT CTGTACTGCA AGGACTTTCG
TCATCGACTG CAGCCACGTC CGCGTCGGGG CCGTCCGGAT TGCTGGGCAT TCTCGGGTCT
GGATCTTCCT GGCTCGACAA ACTCTGGGCG TTACTGGACC CCAACTCCAA TTTCTGGAAC
ACGATAGCTT CGTCCGGACT GTTCTTGCCG AGTAACACGA TTGCGCCCTT TTTGGGTCTA
CTCGGCGGCG TGGCAGCTGC GGATGCGGCC GGGGATGTGT TGGGAGAGGC CACCAGTGGC
GGGCTCGGTG GCGCGCTGGT GGCGCCGCTT GGCTCAGCGG GCGGGCTAGG CGGCACTGTC
GCGGCCGGCC TGGGCAACGC GGCCACCGTC GGAACCTTGT CGGTGCCGCC GAGCTGGACG
GCGGCCGCAC CACTAGCCAG CCCCTTGGGC TCCGCGTTGG GAGGCACACC GATGGTGGCA
CCGCCCCCAG CAGTGGCGGC CGGCATGCCC GGAATGCCTT TCGGCACCAT GGGCGGTCAA
GGCTTCGGGC GTGCCGTGCC CCAGTATGGC TTCCGCCCCA ACTTCGTCGC ACGACCGCCC
GCCGCCGGGT GA
 
Protein sequence
MTAALDFATL PPEINSARMY SGAGSAPMLA AASAWHGLSA ELRASALSYS SVLSTLTGEE 
WHGPASASMT AAAAPYVAWM SVTAVRAEQA GAQAEAAAAA YEAAFAATVP PPVIEANRAQ
LMALIATNVL GQNAPAIAAT EAQYAEMWSQ DAMAMYGYAG ASAAATQLTP FTEPVQTTNA
SGLAAQSAAI AHATGASAGA QQTTLSQLIA AIPSVLQGLS SSTAATSASG PSGLLGILGS
GSSWLDKLWA LLDPNSNFWN TIASSGLFLP SNTIAPFLGL LGGVAAADAA GDVLGEATSG
GLGGALVAPL GSAGGLGGTV AAGLGNAATV GTLSVPPSWT AAAPLASPLG SALGGTPMVA
PPPAVAAGMP GMPFGTMGGQ GFGRAVPQYG FRPNFVARPP AAG