Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tbd_0698 |
Symbol | |
ID | 3672666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiobacillus denitrificans ATCC 25259 |
Kingdom | Bacteria |
Replicon accession | NC_007404 |
Strand | - |
Start bp | 739006 |
End bp | 740484 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637709373 |
Product | NusA antitermination factor |
Protein accession | YP_314456 |
Protein GI | 74316716 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.10534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.240105 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGTG AGATGTTGAT GCTGGCCGAC GCGCTGGCGC GCGAAAAGAA CGTGGACAAG GAGGTGGTGT TCGAGGCCCT CGAGCAGGCG CTCGCCTCGG CCACCAAGAA GCGCTTCAAG GAAGAGTCCG ACGTGCGCGT CGCGATCGAT CGCGAAACCG GTGACTACGA ATCCTTCCGT CGCTGGCTGG TCGTGACCGA GGCCGAGCTC GAATCCGAGG GCTACCAGAT CCTGCTGGTC GACGCCCAGG ACAAGATCCC GGACATCGAA ATCGGCGACT ACATCGAAGA GCCGCTCGAA AACGTCGAGT TCGGCCGCAT CGGTGCCCAG GCCGCCAAGC AGGTCATCCT GCAGAAGATC CGCGATGCCG AGCGCGAGCA GATCATCAGC GACTTCCTCG CCCGCAAGGA ACACCTCGTC AATGGCGTCG TGAAGCGCAT CGACCGTGGC AACGCGATCA TCGAATCCGG CCGCGTCGAA GGCTTCCTGC ACCGCGACCA GATGATCCCG CGCGAGAACC TGCGTGTCGG CGACCGCGTG CGCGCCTACC TGCTGCGCAT CGACCGCGGC AACCGCGGCC CGCAGGTCGT GCTGTCGCGC ACCGCCCCAG AATTCATCAT GAAGCTGTTC GAGCTCGAAG TGCCCGAGAT CGAGGAAGGC CTGCTCGAGA TCAAGGCCGC GGCCCGTGAT CCCGGCCTGC GCGCCAAGAT CGCCGTCGTC TCGCACGATC CGCGCATCGA CCCGATCGGC ACCTGCATCG GCCTTCGTGG GTCGCGCGTC ACCTCGGTGA CCAACGAACT CGCCGGCGAG CGCGTCGACA TCATCCACTG GTCGGCCGAT CCGGCACAGT ACGTGATCAA TGCCCTCGCG CCGGCCGAAG TCAGCTCGAT CGTCGTCGAC GAAGATACGC ACAGCATGGA CGTCGTCGTC GACGAGGAAC AACTCGCGAT GGCGATCGGC CGCGGTGGCC AGAACGTGCG CCTGGCGTCC GAACTGACCG GCTGGGAACT CAACATCATG TCGCGCGAGG CGGCTGAAGA GAAACAGTCG AGCGAAAGCC AGAAGACGCT GCAGCTCTTC ATCGAGAAGC TCGACGTCGA CGAGGAAGTC GCCCAGATTC TGGTCGACGA GGGCTTCTCC ACGCTCGAGG AAGTCGCCTA CGTGCCGCTC AACGAAATGC TCGAGATCGA AGCCTTCGAT GAAGCGCTCG TCAACGAACT GCGCAACCGG GCGCGCAACG CCCTGCTGAC CGCGGCCATC GTCGGCGAGG AGCAGGTCGA GGCCTCGGCC GGCGACCTGC TGTCGCTCGA CGGCATGGAC GCCGAAACCG CACGCTTGCT TGCCAGCAAG GGGGTCCACA CGACCGAGGA TCTGGCGGAG CTGGCGGTCG ACGAGCTGAC CGAAATGGCC GCGATGGACG CGGAACGCGC CAAACAATTG ATCATGGCCG CACGCGCGCC CTGGTTCGCC CAAGGCTAA
|
Protein sequence | MSREMLMLAD ALAREKNVDK EVVFEALEQA LASATKKRFK EESDVRVAID RETGDYESFR RWLVVTEAEL ESEGYQILLV DAQDKIPDIE IGDYIEEPLE NVEFGRIGAQ AAKQVILQKI RDAEREQIIS DFLARKEHLV NGVVKRIDRG NAIIESGRVE GFLHRDQMIP RENLRVGDRV RAYLLRIDRG NRGPQVVLSR TAPEFIMKLF ELEVPEIEEG LLEIKAAARD PGLRAKIAVV SHDPRIDPIG TCIGLRGSRV TSVTNELAGE RVDIIHWSAD PAQYVINALA PAEVSSIVVD EDTHSMDVVV DEEQLAMAIG RGGQNVRLAS ELTGWELNIM SREAAEEKQS SESQKTLQLF IEKLDVDEEV AQILVDEGFS TLEEVAYVPL NEMLEIEAFD EALVNELRNR ARNALLTAAI VGEEQVEASA GDLLSLDGMD AETARLLASK GVHTTEDLAE LAVDELTEMA AMDAERAKQL IMAARAPWFA QG
|
| |