Gene Acid345_4471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4471 
Symbol 
ID4070954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5305801 
End bp5306643 
Gene Length843 bp 
Protein Length280 aa 
Translation table11 
GC content61% 
IMG OID637986510 
ProductHemK family modification methylase 
Protein accessionYP_593545 
Protein GI94971497 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.44182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCA AGCAAGCCTT CGACTCCGCA CTTAAGCATT TAGAAGCAGC CGACACTCCT 
TCCCCTCGCC TGAGCGCCGA GCTCTTGCTG ATGTTCAGTT TGAATTGCGA TCGCGCTTAT
CTCTTTACCT ATCCCGAGCG CGAACTCACC GCCGACGAAC AGGCCCGCTA CGACGAAGCC
ATCGCCCGCC GCTGTCATGG CGAGCCCGCG CAATACATCA CCGGACACCA GGAGTTCTAT
GGTCGCGACT TCCTCGTCTC GCCGGCGGTG CTCATCCCGC GCCCTGAAAC CGAGCACCTG
ATCGAAGCCG TGCTCGAACT CGCGCCACGC GAGGTGCGTT GGGAAGTCCT CGATGTTGGA
ACCGGCTCCG GCTGCATTGC CGCAACGCTT GCCAAAGAAT TTCCGCGGAT GAAAGTCACG
GCCGTCGATA TCTCGCCCGA AGCGCTCCAG ATTGCACAAG CCAATGCCGC CCGCCTCGAA
GCTCAAGTCG AGTTTCGTGT GAGCGATCTA CTCAGCGCGA TCGAACCCGG ACGCCAGTTC
GACATGATCG TCTCCAACCC GCCCTACGTC GGCGAGTGCG AGGCTGACAA AGTCCAGCGC
CAGGTGAAAG ACTTCGAGCC GCACTGCGCC GTCTTCGGCG GCGAGCGCGG CATGGACATC
ATCAAGCGTC TGGCGCCGCA GGTTTGGGAG CACCTCAAAC CGGGCGGCTG GTTCCTAATG
GAAATCGGGT ACTCCATCGC CGATCCCGTC CACGAAATCA TGCGCGACTG GACCAACTTC
AAGGTCGTCC CCGACTTGCG AGGCATCCCG CGCGTTGTCG TCGGCCGCAA ACCAACTTCT
TAA
 
Protein sequence
MTLKQAFDSA LKHLEAADTP SPRLSAELLL MFSLNCDRAY LFTYPERELT ADEQARYDEA 
IARRCHGEPA QYITGHQEFY GRDFLVSPAV LIPRPETEHL IEAVLELAPR EVRWEVLDVG
TGSGCIAATL AKEFPRMKVT AVDISPEALQ IAQANAARLE AQVEFRVSDL LSAIEPGRQF
DMIVSNPPYV GECEADKVQR QVKDFEPHCA VFGGERGMDI IKRLAPQVWE HLKPGGWFLM
EIGYSIADPV HEIMRDWTNF KVVPDLRGIP RVVVGRKPTS