Gene Acid345_4768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4768 
Symbol 
ID4073362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5634043 
End bp5635260 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID637986812 
ProductSAM-dependent methyltransferase 
Protein accessionYP_593841 
Protein GI94971793 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000059274 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0101611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCCCT CCAAAACTGC TACCGCGAAT CCGCCGATCC ACCTGACCCC GCGCGGCGCT 
GCCCGCATTC GTGCCGGACA CGTGTGGGTT TACCGCTCGG ACTTGAAGGA CCAAAAGCCC
AAAGCCGAGC CGGGATCGCT GGTCGCAGTA CTCGATGAAA AGGGCCGCAT CCTTGGTGAT
GCCCTCTACA GCTCAAGCTC ACAGATCGCC ATTCGCATGT TTGCCGTGGG CCGCGAAGAG
ACCTACGCAG TTGAGCTGCC GGAAATCGTC AGCAACCGCG TGGCCGCCGC CATTTCCTAT
CGCGACGAAG TCTTGCAGCA GACTGAGGCT TGCCGGCTGG TGTTCAGCGA GGCCGATTTC
CTGCCCGGTC TTATCGTCGA TCGTTATAAC GACATCCTTA CCATGCAAGT GCTGACCCAG
GCGATGGACC GCGAGGACCT GCGTCGTGCG GTGATGGACT CGCTGCGGGA GCACTTCCCT
AACTCAACGA TCTTTGAGCG CGTAGATGAG CGCATCCGCG AACTTGAGAA ATTGCCGGCG
AAGGAATCGG GTGTTGTCGG CAAGCCACAG AAGGGGAACG CAAAGACCGC GACCATCTTC
AATATGAATG GCCTGCGCTT TCACTATGAT GTGAGCGGCC AGAAGACTGG CGCCTTCCTC
GACCAGCGTG AGAATTACGC CGCAGCCGCG CAATACGCGC GTGGAGACGC CCTCGACGTC
TTCACCTACC AGGGCGGGTT CGCATTGCAC CTGTACCAGA ATTGCCGTCG CGTCACGGGT
GTGGATATGT CGCGTCCAGC GCTTGAAGTT GCCGAGGAGA ATGCGACGCT CAATGAGTAC
AACCTTGAGT GGATTGAAGC TAACGCCTTC GATTATCTAA AAGACCAGTC CGTCGGTGGA
CGCGCTTACG ACACCATCGT TCTGGATCCG CCGGCGTTTG CCAAGACAGC TAAGAACTTC
GATACAGCGA TCCGCGGGTA CAAAGAGCTC AATCTTCGCG CACTGAAGAT GCTGCGTGCC
GGCGGTACGC TTATCAGCTG TTCCTGCTCG TTTCACGTCA GCGAAGCGGA TTTCATGGAA
ATGCTCGGCA CGGCCGCGGC CGACGCTCAT CGCCGCGTGC GCATCCTCGA GAAGCGCAAC
GCCGCCAAGG ACCATCCAAT TCTCATGGGT GTCCCTGAGA CCAATTACCT CAAGTGCATC
ATCGCGCGAG TGGACTAG
 
Protein sequence
MKPSKTATAN PPIHLTPRGA ARIRAGHVWV YRSDLKDQKP KAEPGSLVAV LDEKGRILGD 
ALYSSSSQIA IRMFAVGREE TYAVELPEIV SNRVAAAISY RDEVLQQTEA CRLVFSEADF
LPGLIVDRYN DILTMQVLTQ AMDREDLRRA VMDSLREHFP NSTIFERVDE RIRELEKLPA
KESGVVGKPQ KGNAKTATIF NMNGLRFHYD VSGQKTGAFL DQRENYAAAA QYARGDALDV
FTYQGGFALH LYQNCRRVTG VDMSRPALEV AEENATLNEY NLEWIEANAF DYLKDQSVGG
RAYDTIVLDP PAFAKTAKNF DTAIRGYKEL NLRALKMLRA GGTLISCSCS FHVSEADFME
MLGTAAADAH RRVRILEKRN AAKDHPILMG VPETNYLKCI IARVD