Gene TBFG_10738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10738 
Symbol 
ID5221406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp819532 
End bp821403 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content67% 
IMG OID640605483 
Productprotease IV sppA 
Protein accessionYP_001286683 
Protein GI148821929 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones226 
Plasmid unclonability p-value0.0110097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones205 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATTT TCGGGGGCTT TTGCGTCTGC TCGCGGGCCC TTGGCGGCCG GTGGGTACGC 
TGGGTGAATA TGGTTGCCTT TCTGCCTTCC ATTCCCGTTG TCGAGGACCT ACGCGCCCTG
GTCGGCCGGG TTGATACCGC CCGCCACCAC GGTGTACCCA ACGGCTGCGT GCTCGAATTC
AACCTGCGAT CGGTGCCGCC GGAGACGACG GGCTTCGACC CTCTTACGGT GCTCACCGGG
GGTGGGCGGC CGATGGCGCT GCGCGATGCG GTCGCCGCGA TCCACCGTGC CGCCGAGGAC
CCCCGGGTAG CCGGGCTGAT AGCCCGCGTG CAGCTTCCGC CCTCGCCGGC GGGGGCGGTT
CAGGAGCTGC GGGAGGCCAT CGCGGCCTTC AGTGCGGTCA AGCCGTCGCT GGCCTGGGCC
GAAACTTATC CGGGCACCCT GTCCTACTAT CTGGCTTCGG CGTTCGGTGA GGTCTGGATG
CAACCCTCGG GGAGTGTGGG GCTGGTCGGC TTCGCCACCA ACGCCACATT CCTGCGCGAC
GCCCTGCACA AGGCGGGCAT CGAGGCCCAG TTCGTCGCCC GGGGCGAATA CAAGTCGGCG
GCAAACCTTT TCACCGAGGA TGGCTTCACA GACGCCCACC GCGAAGCGGT CACGCGGATG
CTGGACAGTC TGCAGGACCA GGTGTGGCAG GCGGTCGCCA AGTCGCGCAA TATCGGCGTC
GATGCGCTTG ATGAGCTGGC TGACCGGGCT CCGCTATTGC GGGACGACGC CGTGACTTGC
GGTCTGATCG ACCGGATCGG ATTTCGCGAC CAAGCCTACG CCCGTATGGC GGAATTGGTT
GGTGTGGAAA AAGGTTCACC GGAATCCAGT GGCTCGCAAA CAAGCCCAGA CGAAAAGCCG
CCGCGGATGT ACCTGGCGCG CTACGCCAGT TCGGCCCGGC CACGGCTGAC GCCCCCCGTC
CCATCGATTC CTGGTCGCCG GTCCAAGCCG ACGATCGCGG TGGTGACCCT GGAAGGCCCG
ATCGTCAACG GTCGTGGTGG GCCCCAGTTT CTGCCGCTCG GTCCGTCGAG CGCCGGCGGT
GACACCATCG CGGCAGCGCT GCGGGAGGTG GCCGCCGACG ATTCGGTGTC GGCGATAGTG
CTGCGGGTCG ACAGTCCGGG GGGCTCGGTC ACCGCATCGG AGACTATCTG GCGTGAGGTG
GCCAGGGCCC GCGACCGTGG CAAACCGGTG GTGGCGTCGA TGGGTGCGGT CGCCGCCTCC
GGTGGCTATT ACGTGTCGAT GGGTGCCGAC GCCATCGTGG CCAACCCGGG CACCATCACC
GGGTCGATCG GTGTGATCAC CGGAAAGCTG GTGGTTCGGG ATCTCAAGGA CCGGTTGGGT
GTCGGGTCGG ATGCGGTGCG CACCAACGCT AATGCCGATG CCTGGTCGAT CGACGCACCC
TTCACCCCGG ACCAGCAGGC CCATCGCGAG GCGGAGGCGG ACTTGTTCTA CAGCGACTTC
GTGGAACGCG TCGCCGAGGG CCGCAAGATG ACTACCGACG CCGTGGACGT CGTTGCGCGA
GGCCGGGTCT GGACCGGTGC CGACGCTCTC GATCGCGGCC TGGTCGACGA ACTCGGCGGC
CTTCGAACCG CGGTGCGTCG CGCGAAGGTG CTAGCCGGAC TAGATGAGGA CACCGAGGTT
CGCATAGTCA GTTATCCGGG GTCGTCACTC TGGGACATGG TGCGACCGCG TCCGTCGTCA
CGACCGGCAG CGGCATCGCT GCCGGATGCT ATGGGTGCGC TGCTTGCCCG TTCGATCGTC
GGCATCGTCG AGCAGGTGGA ACAGACTCTC AGTGGTGCCA GCGTGTTGTG GCTGGGGGAG
TCGCGCCTCT AG
 
Protein sequence
MPIFGGFCVC SRALGGRWVR WVNMVAFLPS IPVVEDLRAL VGRVDTARHH GVPNGCVLEF 
NLRSVPPETT GFDPLTVLTG GGRPMALRDA VAAIHRAAED PRVAGLIARV QLPPSPAGAV
QELREAIAAF SAVKPSLAWA ETYPGTLSYY LASAFGEVWM QPSGSVGLVG FATNATFLRD
ALHKAGIEAQ FVARGEYKSA ANLFTEDGFT DAHREAVTRM LDSLQDQVWQ AVAKSRNIGV
DALDELADRA PLLRDDAVTC GLIDRIGFRD QAYARMAELV GVEKGSPESS GSQTSPDEKP
PRMYLARYAS SARPRLTPPV PSIPGRRSKP TIAVVTLEGP IVNGRGGPQF LPLGPSSAGG
DTIAAALREV AADDSVSAIV LRVDSPGGSV TASETIWREV ARARDRGKPV VASMGAVAAS
GGYYVSMGAD AIVANPGTIT GSIGVITGKL VVRDLKDRLG VGSDAVRTNA NADAWSIDAP
FTPDQQAHRE AEADLFYSDF VERVAEGRKM TTDAVDVVAR GRVWTGADAL DRGLVDELGG
LRTAVRRAKV LAGLDEDTEV RIVSYPGSSL WDMVRPRPSS RPAAASLPDA MGALLARSIV
GIVEQVEQTL SGASVLWLGE SRL