Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4218 |
Symbol | |
ID | 4073144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4995623 |
End bp | 4997227 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986249 |
Product | NusA antitermination factor |
Protein accession | YP_593292 |
Protein GI | 94971244 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000184689 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.152678 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGCG AACTCTACAA CGTAATTGAC GCGCTCAGCC GCGAAAAGGG CATTGACCCG CAGGTCGTCG TGACTGCGGT CGAGGACGCC ATCGTTGTTG CCACCCGTAA GTTCTATAAG ACGGGGGAAA ACTTCCGCGC CGTGCTCGAC AAAGAGTCGG GCCAGATCCG CGCTTATGCC GTTCGCCAGG TAGTGATTAA CGAAGACGAA CTCGAGGATC CCGCGACCCA GGTTCCGCTA GAAGAAGCAC GTGAGCTCGA TCCAGCAGCT GAAGTCGGCG GCGAACTGCT GATTGAGAAG AAGACCGACA TGTTGGGGCG CATCGCGGCA CAGTTGGCGA AGCAGGTCAT CTTCCAGAAG GTTCGCGAAG CTGAGCGCGA TACGGTTTAC AACGAGTACA TCGGGCGCGT GGGTGAAATC GTGAACGCCA CGATGAAGCG CAATGAAGGG CCGGACTTGA TTTGGGACAT CGGCAAGGCG GAGGCCCGCA TGCCGAAGAA GGAACAGTCG CGCCTTGAGT CGTTTGCCAT CGGCGAGCGG GTTCGCGTGG TTATCACCCG CGTCGAGAAG GCCTCCAAGG GGCCGCAGGT TATCGTGTCG CGTGCAGCTC CGGAACTGGT ATCGCACCTC TTCCAGACGG AAGTGCCAGA AATTTACGAC AACACCGTCG TGATCCGCGC CATCGCTCGT GAAGCCGGTG AGCGCACCAA GATCGCCGTG ATGTCAAAGG ACAAGGATGT GGATGCGGTC GGCGCTTGCG TCGGTATGAA GGGCATGCGC GTGCAGTCGA TCATCCGCGA ACTGCGCGGA GAGAAGATCG ACATCATCGA GTACCACGAA GACGCCGTTA CTTTCGCGGA GAAGGCGTTG CAGCCGGCGA AGGTCAGCCG TGTCACCATC CTCGAATCGG GCGACAAGCA TCTCGAAGTG ATCGTCGACG ACACCCAGCT CTCGCTTGCC ATCGGCAAGA AGGGTCAGAA CGTTCGTCTC GCGGCCAAGC TGCTGGGGTG GAAGATCGAC ATTAAGAGTG AGGAAGAGAA GCGCCAGGAA GTTGAGCAGC AGATGTCGGC ACTGGTTAAT CCGAGCATCA CGCCGCTGGA CAAGGTGCCA GATCTCGGCG AGGCCATCAT CGAGAAGCTC TCGGCTGCCG GTATCAATAG CGTGGAAGCT CTGGCCGATA TGACGCCGGA GCAACTCGAA GAGGTTCCGG GAATCGGACC GAAGACGGTG GACAAGATCT TCGTTGCGGT GAACGCGTAC TTCTCGGCAC TCGATGCCGC GGCGGAAGCA GCTGAGGCTG CGTCCGCGGA AGGCGCAACC ACCGAACTTA GCGCGAGTGA CACGACCGAC AACCAGGCGC AAGCCGATGA ACTGGGCAAT CGCGAAGAGC TCACCGGATC CGCGGGACAG GCCGGTGACG ATGCGGCCGT TTCGGGTACG CCCGAGGCAA GTGAAGAGAG CGTCAAGAAC CTGGTAGATA CAGAACAGAG TTACGAAGCA GCGGCAGTCA GTGGCGTTGA GAACGCGCCG CCGGCTGACG AGGCGGAAGT CACCACGCAC GGCGAACAAC CGGGCGAGGA CGACCTTCCG GCAGAAGAAA AGTAG
|
Protein sequence | MASELYNVID ALSREKGIDP QVVVTAVEDA IVVATRKFYK TGENFRAVLD KESGQIRAYA VRQVVINEDE LEDPATQVPL EEARELDPAA EVGGELLIEK KTDMLGRIAA QLAKQVIFQK VREAERDTVY NEYIGRVGEI VNATMKRNEG PDLIWDIGKA EARMPKKEQS RLESFAIGER VRVVITRVEK ASKGPQVIVS RAAPELVSHL FQTEVPEIYD NTVVIRAIAR EAGERTKIAV MSKDKDVDAV GACVGMKGMR VQSIIRELRG EKIDIIEYHE DAVTFAEKAL QPAKVSRVTI LESGDKHLEV IVDDTQLSLA IGKKGQNVRL AAKLLGWKID IKSEEEKRQE VEQQMSALVN PSITPLDKVP DLGEAIIEKL SAAGINSVEA LADMTPEQLE EVPGIGPKTV DKIFVAVNAY FSALDAAAEA AEAASAEGAT TELSASDTTD NQAQADELGN REELTGSAGQ AGDDAAVSGT PEASEESVKN LVDTEQSYEA AAVSGVENAP PADEAEVTTH GEQPGEDDLP AEEK
|
| |