Gene Tbd_2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_2653 
Symbol 
ID3672382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp2726202 
End bp2729333 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content69% 
IMG OID637711359 
Producthypothetical protein 
Protein accessionYP_316411 
Protein GI74318671 
COG category[S] Function unknown 
COG ID[COG3002] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTGG CACTCGGGCG GCGCCTCAAG ATTCGTGCTC TGGTCCATGT TGCGGGCGAA 
CCGATCCCGT ATTTCTGGCC GATGCGCACC TTCATCCACC ACAATCCGCT CTACGGCCTC
GAGCATCTGC CGTTCGAGCA GGCCGTCGCG ATGGGCGAGC GGCTGTTCCG TGCCCACGGC
TTCCTGCCGC GCGCGCGCCA GCAGGCCTAC CTCGCCGCCG GCCGGGTCGA CGCCACGGTG
CTCGCGGCGC AGGTCGCGCG CTTTTGCGCG GACCAGCCCG AGGTCGCCGG CCTCGACCTG
GAGCGCCTGC TCCTGACGCT CCTGACCGAC GTCGAAACGC CGCTCGGTGC GCCGCCCACG
CTCGCCGACG CGGCCGACGT CCACGCCGTG CTCAGGGGCG CCGCGCTGCC GGCGCGCGAG
ATCCCGAGCG GGGCGCTGGC GGCCCAGGTC GGCAGCGACA TGCCGCCCGG GCGGCCGCTC
TACGCCATGC TCGATCTGCT GTTCGGCACC GAGATCGGCG CGACGCTCGA CGAACTGGTG
ATCAAGAGTT GCCTCGATTT CTTCGACGAG GGCCAGTCGG TCTGGCAGAT GCCCGGCCGC
GAGCAGGGTC TGTTCAGGGC CTGGAGCGCG GTCGCGCGGC GCAATCTGCG CCTCTTCATC
CGCGGCCTGC ACATCAAGCG CATCCTCGCC GTCGACGACA CGCCCGAAGG CATCATCAGC
CACGTCATGG GCGAACTCGG CGTGCCCGAG GACGACTGGA TGAACCACTT CACCTGCGAG
CTGACGCGGC TGCATGGCTG GGCCGGGTTC ATCCGCTGGC GTTCGGGCGC CAAGCACTAC
CACTGGACGC GCCGCTACCC GGCCGACCTC GTCGACTACC TCGCGATCCG GCTCGTGCTC
GGTCTCGCGT TGCTGCGCGA GCACGCGGCG CGCGCGGGCA CGCCGGCCAA CCTCGCCGAA
CTCGCGCGCC GCGTCGAATC GAATCCGGCC GAAGCCTATC TGTGCCGCGA ATTCCATGGC
GGGCGGGTGC TGCCAGAGAT GGCGCACGCG GTCGAGGACG CGATCGCTGC GCGCAGGCCG
GCGCGCACCG CGCGGCTACT GCCGCGCTAT CTCGAGCGCA AGCGCGAAAT CGAGGCGCGC
CGTCACGCCC AGTCCCTGAC TCGGCTCGCC GAGCGCGCCG GGATGGGCGC CGCCTTGCAG
CGGCTCGCGC CGGACGACCT CGCGCGCCTG ACCGCGCTGC TCGCCCGCTT CGAGGACGAA
GAGGGGCGCA TGTGGCTCGC TGCCCGCGAA GCGCACTACA TGGGCCGCTT GCTGCCCTGC
CTCGACCTCG CCCCGCCGGC GCCGCCCGAG AAGCGGCCGT TCGCGCAGGT GATGTTCTGC
ATCGACGTGC GCTCGGAGCG CATTCGCCGC CATCTCGAAA AGCTCGGCAG TTATCAGACC
TTCGGCATCG CCGGCTTCTT CGGCGTGCCG GTGAGTTTCA TCGGCCTCGA GAAGGGCAGT
GAGACGCATC TCTGCCCGGT CGTCGCGACG CCGAAGAACG TCGTGCTCGA ACTCGCGATC
ACGCGCAACG CCGACGACGA GGCCTTCGTC TCGACGCTCG AGCAGGTGTT CCACGAGCTG
AAGGCCTCGG TGCTGTCGCC CTTCATCACG GTCGAAGCGA TCGGCCTGCT GTTCGGCCTC
GACATGTTCG GCAAGAGCCT CGCGCCGCTC GCCTACAGCC GCTGGCGCGA GCGGCTGCAC
CCGGACAAGC CTGACACCCG CCTGCTCCTC GACAAGCTGT CGCGCGAGCA GGCCGAGTCG
ATCATCCGAT CGCTGCAGCG CGCGCTGATC GTGAAGGCGG TGCGCCGCGA ACTCGGCATC
CCGCGCGAGT TGCTGACCGA CGAGATGATC CGCGAACTGC GCGAAACTGC GCTCGGCAAC
CAGGCCCAGG CCGCCGGCTT CGCGCAGCGC TTCGAGCTCG ACTGCGACGC CGAAACCGGC
TTCGTCGAAC GGCTGCGGCA GGTCTACCGC ATCGACCGGG GCTACGCCCG GCTCCAGCTC
GAGCGCCTCG GACGCATCGG CTTCACGCTC GACGAGCAGG TGCATTTCGT CGGCCAGGCA
CTGCGCTCGA TCGGCCTCGT CTCGGGCTTC TCGCGCTTCG TCCTGCTCAC CGGGCACGGC
AGCACCTCGG AGAACAATCC CTACGAATCC GCGCTCGACT GCGGCGCCTG CGGCGGCAAT
CACGGCATCA CCAACGCCCG CGTGCTGGCG CAGATCGCCA ACAAGACGGC CGTGCGTGCG
CGCCTGCGCG AGCAGGGCAT CGTCATTGCC GACGACACCT GGTTCGTCCC GGCCTTCCAC
AACACGACGA CCGATGAGCT GCGCCTTTAC GACCTCGATC TGCTGCCGCC GAGCCATCTC
GTCTATACCG AGCGCCTGAT CAACGGCCTG CAGGCGGCCT CGCATCTGTG CGCGGCCGAG
CGCATGCGCA CGCTGCAGGA CACCCCGGGC GACGCCGATG AAAACGGCGA CTCGGCGGGC
GCCTATCGCC TGGCGCGCCG CAACGCCCTG GACTGGTCGC AGGTGCGCCC GGAATGGGGT
CTGGCTCGCA ACGCCGCCTT CGTCATCGGC CGCCGCGACG CGACCGGGGG GCTCGACCTT
GAGGGCCGTG TGTTCCTGCA TTCCTACGAC TACCGCTGCG ATCCCAGGGG ACGCCTGCTC
GAGAACATCC TCGCGGGCCC GCTCGTGGTC GGCCAGTGGA TCAACATGGA GCACTATTTC
TCGGCCGTCG ACAACGCGCA CTACGGCAGC GGCAGCAAGG TCTATCACAA CATCGCCGGC
CGCTTCGGCG TGATGACCGG AAACCTCTCC GACCTGCGCA CCGGACTGCC GGCGCAGACC
GTGCTCAAGG ACAGCGCGCC GTATCACGAG CCGCTACGCC TGTTGACCGT GATCGAGGCG
CCTTTCGCGC ACGCCCGGGC CGCGGTCGAG GGCGTCGTGA AGGTCAAGAA TCTCATGCAC
AACGGCTGGC TGCGCATGGC CGTGGTGGAC CCCGAAACCC GTTTTGCCTA CGTATTCGAG
GACGGCGGCT GGCGGCAGTA TCCGCACGAT GCCGTCAGCG AAGCCGTCGA AGAAAAGGAG
ACCGTGCTGT GA
 
Protein sequence
MTLALGRRLK IRALVHVAGE PIPYFWPMRT FIHHNPLYGL EHLPFEQAVA MGERLFRAHG 
FLPRARQQAY LAAGRVDATV LAAQVARFCA DQPEVAGLDL ERLLLTLLTD VETPLGAPPT
LADAADVHAV LRGAALPARE IPSGALAAQV GSDMPPGRPL YAMLDLLFGT EIGATLDELV
IKSCLDFFDE GQSVWQMPGR EQGLFRAWSA VARRNLRLFI RGLHIKRILA VDDTPEGIIS
HVMGELGVPE DDWMNHFTCE LTRLHGWAGF IRWRSGAKHY HWTRRYPADL VDYLAIRLVL
GLALLREHAA RAGTPANLAE LARRVESNPA EAYLCREFHG GRVLPEMAHA VEDAIAARRP
ARTARLLPRY LERKREIEAR RHAQSLTRLA ERAGMGAALQ RLAPDDLARL TALLARFEDE
EGRMWLAARE AHYMGRLLPC LDLAPPAPPE KRPFAQVMFC IDVRSERIRR HLEKLGSYQT
FGIAGFFGVP VSFIGLEKGS ETHLCPVVAT PKNVVLELAI TRNADDEAFV STLEQVFHEL
KASVLSPFIT VEAIGLLFGL DMFGKSLAPL AYSRWRERLH PDKPDTRLLL DKLSREQAES
IIRSLQRALI VKAVRRELGI PRELLTDEMI RELRETALGN QAQAAGFAQR FELDCDAETG
FVERLRQVYR IDRGYARLQL ERLGRIGFTL DEQVHFVGQA LRSIGLVSGF SRFVLLTGHG
STSENNPYES ALDCGACGGN HGITNARVLA QIANKTAVRA RLREQGIVIA DDTWFVPAFH
NTTTDELRLY DLDLLPPSHL VYTERLINGL QAASHLCAAE RMRTLQDTPG DADENGDSAG
AYRLARRNAL DWSQVRPEWG LARNAAFVIG RRDATGGLDL EGRVFLHSYD YRCDPRGRLL
ENILAGPLVV GQWINMEHYF SAVDNAHYGS GSKVYHNIAG RFGVMTGNLS DLRTGLPAQT
VLKDSAPYHE PLRLLTVIEA PFAHARAAVE GVVKVKNLMH NGWLRMAVVD PETRFAYVFE
DGGWRQYPHD AVSEAVEEKE TVL