Gene Acid345_4218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4218 
Symbol 
ID4073144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4995623 
End bp4997227 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content60% 
IMG OID637986249 
ProductNusA antitermination factor 
Protein accessionYP_593292 
Protein GI94971244 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000184689 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.152678 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCG AACTCTACAA CGTAATTGAC GCGCTCAGCC GCGAAAAGGG CATTGACCCG 
CAGGTCGTCG TGACTGCGGT CGAGGACGCC ATCGTTGTTG CCACCCGTAA GTTCTATAAG
ACGGGGGAAA ACTTCCGCGC CGTGCTCGAC AAAGAGTCGG GCCAGATCCG CGCTTATGCC
GTTCGCCAGG TAGTGATTAA CGAAGACGAA CTCGAGGATC CCGCGACCCA GGTTCCGCTA
GAAGAAGCAC GTGAGCTCGA TCCAGCAGCT GAAGTCGGCG GCGAACTGCT GATTGAGAAG
AAGACCGACA TGTTGGGGCG CATCGCGGCA CAGTTGGCGA AGCAGGTCAT CTTCCAGAAG
GTTCGCGAAG CTGAGCGCGA TACGGTTTAC AACGAGTACA TCGGGCGCGT GGGTGAAATC
GTGAACGCCA CGATGAAGCG CAATGAAGGG CCGGACTTGA TTTGGGACAT CGGCAAGGCG
GAGGCCCGCA TGCCGAAGAA GGAACAGTCG CGCCTTGAGT CGTTTGCCAT CGGCGAGCGG
GTTCGCGTGG TTATCACCCG CGTCGAGAAG GCCTCCAAGG GGCCGCAGGT TATCGTGTCG
CGTGCAGCTC CGGAACTGGT ATCGCACCTC TTCCAGACGG AAGTGCCAGA AATTTACGAC
AACACCGTCG TGATCCGCGC CATCGCTCGT GAAGCCGGTG AGCGCACCAA GATCGCCGTG
ATGTCAAAGG ACAAGGATGT GGATGCGGTC GGCGCTTGCG TCGGTATGAA GGGCATGCGC
GTGCAGTCGA TCATCCGCGA ACTGCGCGGA GAGAAGATCG ACATCATCGA GTACCACGAA
GACGCCGTTA CTTTCGCGGA GAAGGCGTTG CAGCCGGCGA AGGTCAGCCG TGTCACCATC
CTCGAATCGG GCGACAAGCA TCTCGAAGTG ATCGTCGACG ACACCCAGCT CTCGCTTGCC
ATCGGCAAGA AGGGTCAGAA CGTTCGTCTC GCGGCCAAGC TGCTGGGGTG GAAGATCGAC
ATTAAGAGTG AGGAAGAGAA GCGCCAGGAA GTTGAGCAGC AGATGTCGGC ACTGGTTAAT
CCGAGCATCA CGCCGCTGGA CAAGGTGCCA GATCTCGGCG AGGCCATCAT CGAGAAGCTC
TCGGCTGCCG GTATCAATAG CGTGGAAGCT CTGGCCGATA TGACGCCGGA GCAACTCGAA
GAGGTTCCGG GAATCGGACC GAAGACGGTG GACAAGATCT TCGTTGCGGT GAACGCGTAC
TTCTCGGCAC TCGATGCCGC GGCGGAAGCA GCTGAGGCTG CGTCCGCGGA AGGCGCAACC
ACCGAACTTA GCGCGAGTGA CACGACCGAC AACCAGGCGC AAGCCGATGA ACTGGGCAAT
CGCGAAGAGC TCACCGGATC CGCGGGACAG GCCGGTGACG ATGCGGCCGT TTCGGGTACG
CCCGAGGCAA GTGAAGAGAG CGTCAAGAAC CTGGTAGATA CAGAACAGAG TTACGAAGCA
GCGGCAGTCA GTGGCGTTGA GAACGCGCCG CCGGCTGACG AGGCGGAAGT CACCACGCAC
GGCGAACAAC CGGGCGAGGA CGACCTTCCG GCAGAAGAAA AGTAG
 
Protein sequence
MASELYNVID ALSREKGIDP QVVVTAVEDA IVVATRKFYK TGENFRAVLD KESGQIRAYA 
VRQVVINEDE LEDPATQVPL EEARELDPAA EVGGELLIEK KTDMLGRIAA QLAKQVIFQK
VREAERDTVY NEYIGRVGEI VNATMKRNEG PDLIWDIGKA EARMPKKEQS RLESFAIGER
VRVVITRVEK ASKGPQVIVS RAAPELVSHL FQTEVPEIYD NTVVIRAIAR EAGERTKIAV
MSKDKDVDAV GACVGMKGMR VQSIIRELRG EKIDIIEYHE DAVTFAEKAL QPAKVSRVTI
LESGDKHLEV IVDDTQLSLA IGKKGQNVRL AAKLLGWKID IKSEEEKRQE VEQQMSALVN
PSITPLDKVP DLGEAIIEKL SAAGINSVEA LADMTPEQLE EVPGIGPKTV DKIFVAVNAY
FSALDAAAEA AEAASAEGAT TELSASDTTD NQAQADELGN REELTGSAGQ AGDDAAVSGT
PEASEESVKN LVDTEQSYEA AAVSGVENAP PADEAEVTTH GEQPGEDDLP AEEK