Gene Acid345_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3229 
Symbolrho 
ID4072564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3821333 
End bp3822580 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content59% 
IMG OID637985250 
Producttranscription termination factor Rho 
Protein accessionYP_592304 
Protein GI94970256 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000548676 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0477745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG CAGAACTGAA AGAAAAGAAC ATCACCGAGC TTACCCGCAT AGCTCGTTCG 
CTCGACCTTC CCGGCGCCAG CGGCCTCCGC AAGCAGGACC TTATCTTCAA GATCCTCCAG
GCGCAGAGCG AAAAAGAGGG CCACATCTTC GCAGAAGGTG TCCTCGAAAT CCTGCCCGAC
GGCTACGGTT TCCTCCGCTC CCCGGATTAC AACTACCTCC CCGGTCCAGA CGACATCTAC
GTCTCGCCTT CACAGATTCG CAAATTCGAC CTCAAGACCG GCGACACCAT CAGCGGACAA
GTCCGCCCGC CGCATGAAGG CGAAAAGTAC TTTGCGCTCG TCAAGATTGA AGCCGTTAAC
TTCGAATCGC CCGACGAAGC TCGCAACAAG ATTCTCTTCG ACAACCTGAC TCCGCTTTAT
CCGCAGGAGC GGATCAAACT GGAGACCGTG CGCGACAATA TCTCCGCGCG CGTGATGGAC
CTGCTCACGC CGGTGGGTAA AGGCCAGCGC GGCCTGATCG TCGCGCCGCC CCGCACCGGT
AAGACGATGC TGTTGCAGAA CCTGGCGAAC TCGATCACCA CGAACCATCC CGAGATCGTG
CTCATCGTTC TGCTGATCGA CGAGCGTCCG GAAGAAGTTA CCGACATGCA GCGCTCGGTG
AAGGGCGAGG TCATCTCCTC GACGTTTGAC GAGCCCGCTG CCCGCCACGT GCAGGTTGCG
GAAATGGTCA TCGAGAAGGC GAAGCGGCTG GTCGAGCACA AGCGCGACGT CGTCATCCTA
CTCGATTCGA TCACGCGACT GGCGCGTGCT TACAACACCA TCGTTCCGCC CTCGGGCAAA
GTGCTCTCCG GCGGTGTGGA TTCCAACGCG TTGCAGCGTC CGAAGCGTTT CTTCGGCGCA
GCCCGCAACA TCGAAGAAGG CGGCTCGTTG ACGATCATTG CCACGGCATT GATCGAAACC
GGATCGCGCA TGGACGACGT GATCTTCGAA GAGTTCAAGG GCACCGGCAA CATGGAAATC
ATTCTCGACC GGAAACTGGC GGACAAGCGC ACGTTCCCGG CGATCGATAT CCAGCGCTCC
GGCACCCGTA AGGAAGAGCT GCTGCTCGCG AAGGAAGACC TGCAACGGAT TTGGATTCTT
CGCCGCGTGC TGAACCCGCT CTCACCTGTG GAAGCGATGG AATTGCTCAT CGACAAGCTG
GGCAAGAGCC GGAACAATGG CGAGTTCCTG AGCAACATGA ACTCCTAG
 
Protein sequence
MTIAELKEKN ITELTRIARS LDLPGASGLR KQDLIFKILQ AQSEKEGHIF AEGVLEILPD 
GYGFLRSPDY NYLPGPDDIY VSPSQIRKFD LKTGDTISGQ VRPPHEGEKY FALVKIEAVN
FESPDEARNK ILFDNLTPLY PQERIKLETV RDNISARVMD LLTPVGKGQR GLIVAPPRTG
KTMLLQNLAN SITTNHPEIV LIVLLIDERP EEVTDMQRSV KGEVISSTFD EPAARHVQVA
EMVIEKAKRL VEHKRDVVIL LDSITRLARA YNTIVPPSGK VLSGGVDSNA LQRPKRFFGA
ARNIEEGGSL TIIATALIET GSRMDDVIFE EFKGTGNMEI ILDRKLADKR TFPAIDIQRS
GTRKEELLLA KEDLQRIWIL RRVLNPLSPV EAMELLIDKL GKSRNNGEFL SNMNS