Gene Acid345_4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4229 
Symbol 
ID4073155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5011960 
End bp5013258 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content63% 
IMG OID637986260 
Productsun protein 
Protein accessionYP_593303 
Protein GI94971255 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.494405 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAAGA ATCCCTCCCG CGCCACCGCC TTCGATATCC TTCTCCGCGT CGAGCGCGAC 
CAGGCCTTTG CCTCGGAACT GCTCCACTCC GATCGTCTCA ACGATCTCTC CGCACCAGAT
CGCGGCCTTG CCACCGAACT CGTCATGGGC ACGTTGCGCT GGCAATCCAC ACTCGACGCA
CTCGTCGCCA CGCAATCCTC ACAGCCACTC CGCAAACTCG ACATCGAGGT CCTGATCGCA
CTTCGTCTCG CGGCCTACCA ACTACAGTTT CTCGACCGCA TCCCCGCCAA CGCCGCCGTA
AACGAGAGCG TGGAATTGGT GAAGCGCGCG CGCAAACGCT CCGCCGTGCC GTTCGCCAAC
GCAGTTCTGC GCAAAATCTC CAAGCTTCCA CGCGAAATCC ATGGTGATCT CGCCCATCCT
GCGTGGCTAG TGGCGCGCTG GCGCGACAAT TACGGCGGGG ATGCCGCAGA ATCCATCTCC
AAATACGGTC AAACTACGCC GGAAACCGCG CTACGGCTCC CATTCGACGC CGAAAAACGC
GCCAAAGTTG AGGCAGAACT CCAGGAAAAC GGCGTAGAAC TCGCTCCCGG AAGGCTCCTG
AACGCCGCAC GGCGCCTCGT CAGCGGCGAC CTCAGCGGCA CCGCGGCCTT CCAACGCGGC
GACGTCTGGA TCCAGGATGA AGCCTCCCAA CTCGTCGCCC TTCTCACGGG CCACGGCGAT
CGCATTCTCG ACTGCTGCGC CGCTCCCGGC GGAAAAACTT CCGTCTTGGC CGAGCGCAAT
CCTTCGTCGA AAATTGTCGC CCTAGAACTC CACGAACAAC GTGCACGCCT ACTTCGAGAA
CGCGTTCGCG CGTCAAACGT GGATGTGCAA ACCGCCGATG CCACGAATTT CCGCGCCGAA
ACCGCGTTTG ACTGCGTCCT AGCCGACGTC CCTTGCTCCG GTACCGGCAC CCTCGCTCGC
AATCCCGAAA TCAAGTGGCG CCTAAAGCCC GAGGACCTCG CCGATCTCCA GCAACGCCAG
ATCGCCATCC TCCGCGCCGC CCTAAGCCAA CTCGCGCCCG GCGGCCGCCT CGTCTACTCG
ACATGCTCTC TCGAACCAGA AGAAGGTGAA GCCGTAGTCG AAGCCTCGCT GACCGACGAG
TTCGAACTCC AACCCGCAGC GCCCGAACTT GAGCAATTCG CACCCGCATT CGCCATCCCC
GACCCGCAGA CTCTCGTCCG CGGCCCCTAC CTTCGCACCA TCCCTGGCAT TCACCCCTGC
GAAGGATTCT TCGCCGCCGT AATCACACGT CGCCAGTGA
 
Protein sequence
MSKNPSRATA FDILLRVERD QAFASELLHS DRLNDLSAPD RGLATELVMG TLRWQSTLDA 
LVATQSSQPL RKLDIEVLIA LRLAAYQLQF LDRIPANAAV NESVELVKRA RKRSAVPFAN
AVLRKISKLP REIHGDLAHP AWLVARWRDN YGGDAAESIS KYGQTTPETA LRLPFDAEKR
AKVEAELQEN GVELAPGRLL NAARRLVSGD LSGTAAFQRG DVWIQDEASQ LVALLTGHGD
RILDCCAAPG GKTSVLAERN PSSKIVALEL HEQRARLLRE RVRASNVDVQ TADATNFRAE
TAFDCVLADV PCSGTGTLAR NPEIKWRLKP EDLADLQQRQ IAILRAALSQ LAPGGRLVYS
TCSLEPEEGE AVVEASLTDE FELQPAAPEL EQFAPAFAIP DPQTLVRGPY LRTIPGIHPC
EGFFAAVITR RQ