Gene Tbd_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_0195 
Symbol 
ID3673641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp206569 
End bp209364 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content67% 
IMG OID637708856 
ProductTPR repeat-containing protein 
Protein accessionYP_313953 
Protein GI74316213 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID[TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGAC ATCCCGCTCC CCTGATTGCG TCGCTGCTGG TCCTGGCCTT TTGCGCGCCG 
CTCGCCGGCT GCGATCCCAC GGCCGGACTC AGCGCCCAGG AACACGTCCA GCGCGCCAAG
GACTTCGAGG ACAAGGGCGA CCTCAAGGGC AGCGTCATCG AACTGAAGAA CGCGATCCAG
AAGAATCCCG ACAGCGCCGA AGCCCGGCTC CTGCTGGGAC AGGTCTACCT CAAGGCGGGC
TTCGGCGCCG AAGCGGAAAA GGAACTTCGA CAAGCCGAGC GGCTCGGCGT CGGCCGCGCC
ACCCTCGAGC CCCTGCTCGG GGAAGCCCTG CTGCTGATGG GCGAGTACGC GCGCGTCCTC
GACGAAATCC AGCCGGACAC GCAGGGACCG AAGGAGCGCC TGTCGCGCAT CCTGCAACTG
CGTGGCGAGG CCCTGCTGAA CCAGCGGAAA CTGGAGGAGG CGTGCAATCT GTTCCAGCAG
TCCTACGATG CCTCGCCCGG CAACCCGCCC ACCTACTGGG GCCTTTCGCG CTGCGCGCTG
GCAACGGGTG ACGCGGCGAA GGCGCGCGAC TGGCTTGAAC GCGCGCTCAA GCTCGAGCAC
AAGCGCGCCC GCACCTGGAT TCACCTCGGC AACCTCGAAT TGGCCGGCAA GGATACGGCG
AAGGCGCTCG CTGCCTATTC GAAGGCTGTG AAGATCGAAC CGAACAATCT GGATGCGCTG
CAGAGTCTGG TCGCGATTCA CGTCAAGGCG GGAGACACCC AGCGCGCGCG CGAGTACTTG
GCCGTGATCA GGAAGCTCGC GCCCAAATCG ACCCGCGCAC ATTACCTCGA GGCGTCGATC
GCCTACAGCG AGAAGAAATT CGCCGAGGCA AACGCCGCGA TTCAGGAAGC CCTGAAAGTC
TCGCCCGACC ATGTTCCGAG CCTGATGCTC GCCGGCATGA GCGCCCATGC GCTCGGCTCC
TACCAGGAGG CGGAAACGTA TTTCAAGCGC TTTCTGCTGC GGGTTCCCGG CCACGCGGAA
GGGCTCAAGA TGCTTGCGAC GACGCAAATC AAGTCGAAGC AATTCGACAA GGCGCTCGTC
ACGCTCGCCC CTTTCCTCGC CCCCGGGGTG CGGGATGCAC AGGGTCTGGC GCTGGCGGGC
GAAGCGCAGA TGGCCAACGG CAACCCGAGC CAGGCCGCGG CGCTCTTCGA ACGCGCGCTC
GCGCTCGAAC CCGGCAACGT CACGATACGT ACGCAGCTCG GCCTGAGTCA GCTCGCCGCC
GGGAACACGC AAGACGCCAT CGACGAGTTG ACCGATGCCT CACAGCATTC TTCGGGCTCC
CAGGCGGACA CGCTGCTTGC GGTCGCCTAT CTGAGCCGCA AGGATTACGA CCGCGCACTC
GCCGCGCTTG CGACCCTACA GAAAAAAGGC GACGCCAGCG CGAAAATCCA TCACCTGGCC
GGGCAGGCCT ACCTCGGCAA GAACGACAAG CTTGCCGCCC GCCGTAATTT CGAACAGGCG
CTCGCCGCCG ACGCGGCGTT CTTCCCCGCG GTCGCCAGCC TCGCGCAGCT CGACGTGGCC
GAGAACAAGG CGGACGCGGC CCGCATGCGT CTCGAGCGCG CGCTCGCCCA GGACAAGAAC
CGGGTCGCCG CGATGCTCGC GCTCTCGCGG ATGGCTGCCC GCAATGGTCA GGAGCAGGCC
TCGATCGACT GGCTCGAGAA AGCCGCCCGC GCCGACGGCA AGGCGATACA GCCGCGCATC
GAACTGGTAC GGCATTACCT GGCCCGCAAC GAGGGCCAGA AGGCGCTCGC CCTGGCCAAC
GAGGCGGTCC GCGCCAACCC CGACCACCCT GCCGCGCTCA ACCTGCTCGG CACGGTGCAA
CTCGCGCTCG ACGACAAGGC GAGTTCGGCG AGCACCTTCA GCCGGCTTAC CCGGGAGACT
AGGCAGTCGC CGGAAGGCTT CGTGCGTCTC GCGCAGGTGC AGCTGGCCGA CGGTAAACTC
GACGAAGCGC GCCGCAACCT GCTGCACGCG CTGGAACTCG CGCCGGGACA TCTCAAAAGC
CAGGAGGCAT TGATCAAGCT GGAACTCGCC GCCAAGCGCC CCGAGGCCGC GCTTCTCGTC
GCGCGCGACA TCCAGAAGGG CCACCCCGAT TCCGCCGTCG GCTTCGTACG CGAAGGCGAC
ATTCTGCTCG CCGAAAAACG CATCGCGCAG GCTGTGCCGG CCTACGTCCG CGCGCTGGAA
CATGGCGCCG GGCCGGCTGT GCTGGTCCAG TTCCACCGAG CGACCGTCCT CTCGGGGCAG
AACCGTGCGG CGGCCGACCG CCGGCTCGAG GACTGGATCC GGCAGCATCC GAAGGACAGC
GGCGTCGCCG CATACGCAGC CGGGTACTAC CTTGTCACCG GGCAAAGCGC GCGCGCTGCG
GAGACTTATC GGCAGATCCT GAAGCACGAA CCACGCAACG TCATGATCCT GAACAATCTC
GCCAGCCTCT ATCTGCAGCA GCGAGACCCG CGCGCGCTCG AGCTCGCGAC CCAGGCCAAC
CGACTCGCGC CGACCAACCC GGCCGTCCAG GACACCCTGG GCTGGGTTCT GGTCGAACAA
GGCCAGGCCC GGCGCGGACT CGGGTACCTG CGCAAGGCGA TGGCCCAGAC ACCGAAGAAC
GCGAGCCTGC GCTACCACCA CGCGGTGGCG CTCGCCCGCA CCGGAGACCG CCCAGGTGCG
CGCAAGCTGC TCGAGCAGCT GCTCGCCGAA ACGCCGCGCT TCGAGGAACG CGCTGCGGCG
GAGACCCTAC TCAAGAGCCT GCCGGCTGCT TCCTGA
 
Protein sequence
MTRHPAPLIA SLLVLAFCAP LAGCDPTAGL SAQEHVQRAK DFEDKGDLKG SVIELKNAIQ 
KNPDSAEARL LLGQVYLKAG FGAEAEKELR QAERLGVGRA TLEPLLGEAL LLMGEYARVL
DEIQPDTQGP KERLSRILQL RGEALLNQRK LEEACNLFQQ SYDASPGNPP TYWGLSRCAL
ATGDAAKARD WLERALKLEH KRARTWIHLG NLELAGKDTA KALAAYSKAV KIEPNNLDAL
QSLVAIHVKA GDTQRAREYL AVIRKLAPKS TRAHYLEASI AYSEKKFAEA NAAIQEALKV
SPDHVPSLML AGMSAHALGS YQEAETYFKR FLLRVPGHAE GLKMLATTQI KSKQFDKALV
TLAPFLAPGV RDAQGLALAG EAQMANGNPS QAAALFERAL ALEPGNVTIR TQLGLSQLAA
GNTQDAIDEL TDASQHSSGS QADTLLAVAY LSRKDYDRAL AALATLQKKG DASAKIHHLA
GQAYLGKNDK LAARRNFEQA LAADAAFFPA VASLAQLDVA ENKADAARMR LERALAQDKN
RVAAMLALSR MAARNGQEQA SIDWLEKAAR ADGKAIQPRI ELVRHYLARN EGQKALALAN
EAVRANPDHP AALNLLGTVQ LALDDKASSA STFSRLTRET RQSPEGFVRL AQVQLADGKL
DEARRNLLHA LELAPGHLKS QEALIKLELA AKRPEAALLV ARDIQKGHPD SAVGFVREGD
ILLAEKRIAQ AVPAYVRALE HGAGPAVLVQ FHRATVLSGQ NRAAADRRLE DWIRQHPKDS
GVAAYAAGYY LVTGQSARAA ETYRQILKHE PRNVMILNNL ASLYLQQRDP RALELATQAN
RLAPTNPAVQ DTLGWVLVEQ GQARRGLGYL RKAMAQTPKN ASLRYHHAVA LARTGDRPGA
RKLLEQLLAE TPRFEERAAA ETLLKSLPAA S