Gene Tbd_0698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_0698 
Symbol 
ID3672666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp739006 
End bp740484 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content65% 
IMG OID637709373 
ProductNusA antitermination factor 
Protein accessionYP_314456 
Protein GI74316716 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.10534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.240105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGTG AGATGTTGAT GCTGGCCGAC GCGCTGGCGC GCGAAAAGAA CGTGGACAAG 
GAGGTGGTGT TCGAGGCCCT CGAGCAGGCG CTCGCCTCGG CCACCAAGAA GCGCTTCAAG
GAAGAGTCCG ACGTGCGCGT CGCGATCGAT CGCGAAACCG GTGACTACGA ATCCTTCCGT
CGCTGGCTGG TCGTGACCGA GGCCGAGCTC GAATCCGAGG GCTACCAGAT CCTGCTGGTC
GACGCCCAGG ACAAGATCCC GGACATCGAA ATCGGCGACT ACATCGAAGA GCCGCTCGAA
AACGTCGAGT TCGGCCGCAT CGGTGCCCAG GCCGCCAAGC AGGTCATCCT GCAGAAGATC
CGCGATGCCG AGCGCGAGCA GATCATCAGC GACTTCCTCG CCCGCAAGGA ACACCTCGTC
AATGGCGTCG TGAAGCGCAT CGACCGTGGC AACGCGATCA TCGAATCCGG CCGCGTCGAA
GGCTTCCTGC ACCGCGACCA GATGATCCCG CGCGAGAACC TGCGTGTCGG CGACCGCGTG
CGCGCCTACC TGCTGCGCAT CGACCGCGGC AACCGCGGCC CGCAGGTCGT GCTGTCGCGC
ACCGCCCCAG AATTCATCAT GAAGCTGTTC GAGCTCGAAG TGCCCGAGAT CGAGGAAGGC
CTGCTCGAGA TCAAGGCCGC GGCCCGTGAT CCCGGCCTGC GCGCCAAGAT CGCCGTCGTC
TCGCACGATC CGCGCATCGA CCCGATCGGC ACCTGCATCG GCCTTCGTGG GTCGCGCGTC
ACCTCGGTGA CCAACGAACT CGCCGGCGAG CGCGTCGACA TCATCCACTG GTCGGCCGAT
CCGGCACAGT ACGTGATCAA TGCCCTCGCG CCGGCCGAAG TCAGCTCGAT CGTCGTCGAC
GAAGATACGC ACAGCATGGA CGTCGTCGTC GACGAGGAAC AACTCGCGAT GGCGATCGGC
CGCGGTGGCC AGAACGTGCG CCTGGCGTCC GAACTGACCG GCTGGGAACT CAACATCATG
TCGCGCGAGG CGGCTGAAGA GAAACAGTCG AGCGAAAGCC AGAAGACGCT GCAGCTCTTC
ATCGAGAAGC TCGACGTCGA CGAGGAAGTC GCCCAGATTC TGGTCGACGA GGGCTTCTCC
ACGCTCGAGG AAGTCGCCTA CGTGCCGCTC AACGAAATGC TCGAGATCGA AGCCTTCGAT
GAAGCGCTCG TCAACGAACT GCGCAACCGG GCGCGCAACG CCCTGCTGAC CGCGGCCATC
GTCGGCGAGG AGCAGGTCGA GGCCTCGGCC GGCGACCTGC TGTCGCTCGA CGGCATGGAC
GCCGAAACCG CACGCTTGCT TGCCAGCAAG GGGGTCCACA CGACCGAGGA TCTGGCGGAG
CTGGCGGTCG ACGAGCTGAC CGAAATGGCC GCGATGGACG CGGAACGCGC CAAACAATTG
ATCATGGCCG CACGCGCGCC CTGGTTCGCC CAAGGCTAA
 
Protein sequence
MSREMLMLAD ALAREKNVDK EVVFEALEQA LASATKKRFK EESDVRVAID RETGDYESFR 
RWLVVTEAEL ESEGYQILLV DAQDKIPDIE IGDYIEEPLE NVEFGRIGAQ AAKQVILQKI
RDAEREQIIS DFLARKEHLV NGVVKRIDRG NAIIESGRVE GFLHRDQMIP RENLRVGDRV
RAYLLRIDRG NRGPQVVLSR TAPEFIMKLF ELEVPEIEEG LLEIKAAARD PGLRAKIAVV
SHDPRIDPIG TCIGLRGSRV TSVTNELAGE RVDIIHWSAD PAQYVINALA PAEVSSIVVD
EDTHSMDVVV DEEQLAMAIG RGGQNVRLAS ELTGWELNIM SREAAEEKQS SESQKTLQLF
IEKLDVDEEV AQILVDEGFS TLEEVAYVPL NEMLEIEAFD EALVNELRNR ARNALLTAAI
VGEEQVEASA GDLLSLDGMD AETARLLASK GVHTTEDLAE LAVDELTEMA AMDAERAKQL
IMAARAPWFA QG