Gene TBFG_10126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10126 
Symbol 
ID5220789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp151464 
End bp152531 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID640604866 
Productserine protease pepA 
Protein accessionYP_001286071 
Protein GI148821317 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones158 
Plasmid unclonability p-value9.16358e-29 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones206 
Fosmid unclonability p-value0.942677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATT CGCGCCGCCG CTCACTCAGG TGGTCATGGT TGCTGAGCGT GCTGGCTGCC 
GTCGGGCTGG GCCTGGCCAC GGCGCCGGCC CAGGCGGCCC CGCCGGCCTT GTCGCAGGAC
CGGTTCGCCG ACTTCCCCGC GCTGCCCCTC GACCCGTCCG CGATGGTCGC CCAAGTGGGG
CCACAGGTGG TCAACATCAA CACCAAACTG GGCTACAACA ACGCCGTGGG CGCCGGGACC
GGCATCGTCA TCGATCCCAA CGGTGTCGTG CTGACCAACA ACCACGTGAT CGCGGGCGCC
ACCGACATCA ATGCGTTCAG CGTCGGCTCC GGCCAAACCT ACGGCGTCGA TGTGGTCGGG
TATGACCGCA CCCAGGATGT CGCGGTGCTG CAGCTGCGCG GTGCCGGTGG CCTGCCGTCG
GCGGCGATCG GTGGCGGCGT CGCGGTTGGT GAGCCCGTCG TCGCGATGGG CAACAGCGGT
GGGCAGGGCG GAACGCCCCG TGCGGTGCCT GGCAGGGTGG TCGCGCTCGG CCAAACCGTG
CAGGCGTCGG ATTCGCTGAC CGGTGCCGAA GAGACATTGA ACGGGTTGAT CCAGTTCGAT
GCCGCGATCC AGCCCGGTGA TTCGGGCGGG CCCGTCGTCA ACGGCCTAGG ACAGGTGGTC
GGTATGAACA CGGCCGCGTC CGATAACTTC CAGCTGTCCC AGGGTGGGCA GGGATTCGCC
ATTCCGATCG GGCAGGCGAT GGCGATCGCG GGCCAGATCC GATCGGGTGG GGGGTCACCC
ACCGTTCATA TCGGGCCTAC CGCCTTCCTC GGCTTGGGTG TTGTCGACAA CAACGGCAAC
GGCGCACGAG TCCAACGCGT GGTCGGGAGC GCTCCGGCGG CAAGTCTCGG CATCTCCACC
GGCGACGTGA TCACCGCGGT CGACGGCGCT CCGATCAACT CGGCCACCGC GATGGCGGAC
GCGCTTAACG GGCATCATCC CGGTGACGTC ATCTCGGTGA CCTGGCAAAC CAAGTCGGGC
GGCACGCGTA CAGGGAACGT GACATTGGCC GAGGGACCCC CGGCCTGA
 
Protein sequence
MSNSRRRSLR WSWLLSVLAA VGLGLATAPA QAAPPALSQD RFADFPALPL DPSAMVAQVG 
PQVVNINTKL GYNNAVGAGT GIVIDPNGVV LTNNHVIAGA TDINAFSVGS GQTYGVDVVG
YDRTQDVAVL QLRGAGGLPS AAIGGGVAVG EPVVAMGNSG GQGGTPRAVP GRVVALGQTV
QASDSLTGAE ETLNGLIQFD AAIQPGDSGG PVVNGLGQVV GMNTAASDNF QLSQGGQGFA
IPIGQAMAIA GQIRSGGGSP TVHIGPTAFL GLGVVDNNGN GARVQRVVGS APAASLGIST
GDVITAVDGA PINSATAMAD ALNGHHPGDV ISVTWQTKSG GTRTGNVTLA EGPPA