Gene Acid345_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1124 
Symbol 
ID4069239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1399813 
End bp1402827 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content62% 
IMG OID637983133 
Productvitamin B12-dependent ribonucleotide reductase 
Protein accessionYP_590201 
Protein GI94968153 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0209] Ribonucleotide reductase, alpha subunit 
TIGRFAM ID[TIGR02504] ribonucleoside-diphosphate reductase, adenosylcobalamin-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0740505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.99078e-05 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCTGAAG CACCTAAATT AGCGATGAGC ACGGAAGCTC ACACCACCAC CTCCAACCCG 
ATCCCGGCCC CGAAGAAGAA AGCTCCGGGC CTGACCTTCA AGCGCGCCTT CACCAAGGCC
GGCGTTTCGC CCTACGACGA AGTCGAGTGG GAGCTTCGCA CCGCCGCCAT TACTGACGCC
CAGGGCAACA AGATCTTCGA GCAGCTCGAC GTCGAGACGC CCAAAGACTG GTCGATGACC
GCCACCAACA TCGTCGCCAG CAAGTATCTG CACGGCACGC TCGGCACCAG CGAACGCGAG
AGCGGCGTCC GCGCCCTGAT CGCCCGCGTA GCGGAGACCA TTACCCGCTG GGGCATTGAG
GGCGGCTACT TCCGCACCTC CGACGACGCT GCCATCTTCC ACGACGAGCT CGTCCATCTC
CTCGTCCAGC AGAAGATGGC CTTCAACTCT CCCGTGTGGT TCAACGTCGG TTGCGATCGC
ATTGAGCCCC AGGCCGACGG CGCCAATTGG CACTGGAATT TCCTAAAGCA GCAAGCTGAG
TTCGGACCGG TGGGTTATAC CCGTCCGCAA TGCTCCGCCT GCTTCATCAA CTCCGTGCAC
GATTCGCTCG ACAGCATCCT GACGCTTGCC AAGACCGAAG GCATGCTCTT CAAGTGGGGT
AGCGGTACCG GCACCAACCT CTCTCCGCTG CGCTCGAGCA ACGAGCAGCT CAGCGGCGGC
GGCACTGCCA GCGGTCCGCT CAGCTTCATG CGCGGGTTCG ACGCCTTCGC CGGTGTCATC
AAGTCCGGCG GTAAGACGCG TCGCGCCGCC AAGATGGTCA TCCTCAACAT CGAGCATCCC
GACATCGTTG AGTTCATCGA TTGCAAGGCC AAGGAAGAAG CCAAGGCCTG GGCGCTCATG
CAGCAGGGTT ACGACGGCTC CTCGCCTGAC GCCGAAGCTT ACAGCTCCAT TTTCTTCCAG
AACGCCAACA ACAGCGTCCG CGTGACTGAC GACTTCATGT ACGCCGTAGT CCGCGACGCC
GACTTCAGCA CCAAGGCCGT GAAGTCCGGC GATGTGGTCA AGACCTTCAA AGCCCGCTGG
CTTCTCAGCA AGATCAGCGA AGCCACCTGG CAGTGTGGCG ATCCCGGCAT GCAGTACGAC
ACCACGGTCA ATCGTTGGCA TACCAGCAAG AACACCGCGC GCATCAACGC TAGCAACCCT
TGCAGCGAGT ACATGTTCCT CGACGACTCC GCCTGCAACC TCGCTTCGCT CAACCTGATG
AAGTTCGCGC CCAACGGCAC CTTCGACGTC GCCGCCTACC GCCACGCCTG CGCCATCACC
ATTACCGCGC AGGAGATCCT CGTCGACAAC GCCGGCTACC CCACCGAATC CATCATGAAG
AACTCGCACG ACTATCGTCC GCTGGGCCTG GGCTACGCCA ACCTCGGCGC GCTGCTCATG
GCCGCGGGTC TTCCCTACGA TAGCGATGCC GGCCGCGACT ACGCTGCCTG CGTCACTGCG
ATCATGTGCG GCGAAGCGTA TCTCCAGTCA TCGAAGATCG CCGAACTCGG CCAGCCGTTA
ACGCCTGCTA CGCCGATCAC GCAATCGGTC GAGATCACTG GCAGCGCCTG CCCCGGCTGG
TATGTCAACC GCGAGCCTTT CCTCGACGTC ATTCGCATGC ACCGCGCCAG CGTCAACAAC
ATCAACCAGA AGAACGTGCC CGCGCCCGTC TTCGAAGCTG CCAAGGCCAC CTGGGACGAA
GCTCTCCAGC ACGGCGAGCG TCACGGCTAT CGCAACTCGC AGGTCACGGT CCTCGCGCCC
ACCGGCACCA TCGGCTTCAT GATGGACTGC GACACTACTG GCATCGAGCC CGATCTCGCG
CTCGTGAAAT ACAAGAAGCT GGTTGGCGGC GGCATGATCA AGATCGTGAA CAACACGGTC
CCGGCTGCAC TCTTCAAGCT CGGATACACC AGCGACCAGG CCAACGCCAC CGTCAGCTAC
ATTGACGCCA CCGGCACTAT CGAAGGCGCG CCGCACATCA AAGACGAGCA CCTCGCCGTC
TTCGATTGCT CGTTCAAGCC CGCCAAGGGC ACGCGCACCA TCTCCTACAT GGGACACGTC
AAGATGATGG CCGCGACTCA GCCCTTCATC TCCGGCGCCA TCTCCAAGAC CGTCAATCTA
CCGAAAGACG CCACCATTGA CGACATCATG GAGGCCTACA TCCAGAGCTG GAAGCTCGGC
CTGAAAGCGG TCGCCATCTA TCGCGACGGC TCCAAGGGCG GCCAGCCGCT GAATGCCTCC
GGCACTAGCA CCACAGACAA GCTAAAGAAG GCCGAAGCCG CACCGGTCGC CTCCGAGTCC
GAGGACGATG CCGCAGCTCC ACCGCGCGCC ATCCGCCATC GCCTCCCCGA CGAGCGCCTC
TCCGTCACCC ACAAATTCAA CATCGGTGGA CACGAGGGCT ACATCACCGT CGGCCTCTAC
AAAGACGGCA TGCCCGGCGA AGTCTTCATT ACCATGGCCA AGGAAGGCAG CACCGTCAGC
GGCCTCATGG ACAGCTTCGC CTGCGCCACG TCCCTCGCCC TGCAGCACGG CGTCCCGCTC
AAGCTGCTCT GCGAAAAATT CGCCCACACC CGCTTCGAGC CCAGCGGCTG GAGCCATAAT
CCCGACATCG GCTTCGCCAA GTCGATCATG GATTACATCT TCCGCTGGCT CCAACTGCGC
TTCCTAACCG GCCAGCAGCA GGCCCTGTTC GAAGGCCTCC GCCCCAAATA CATGATGGGC
ACACCGTCAT CCAGCGACGC CTCCAGTTCA TCTGTCATCC TGAGCGAAGG CGCACCAGCG
CCGGAGTCGA AGGACCCCTA TCGCTACACG CCACCCGGAG GCCCGGTCGC ACACCACGTC
CACACCGCCG ACGCCCTGAA AGACCTGATC GATCTAGGCG ATGCCCCCAG CTGCCACGTC
TGCGGCGCCA TCATGACCCG CAACGGAAGC TGCTACCGCT GCAATGAATG CGGAAGTACG
AGTGGGTGCT CGTAG
 
Protein sequence
MAEAPKLAMS TEAHTTTSNP IPAPKKKAPG LTFKRAFTKA GVSPYDEVEW ELRTAAITDA 
QGNKIFEQLD VETPKDWSMT ATNIVASKYL HGTLGTSERE SGVRALIARV AETITRWGIE
GGYFRTSDDA AIFHDELVHL LVQQKMAFNS PVWFNVGCDR IEPQADGANW HWNFLKQQAE
FGPVGYTRPQ CSACFINSVH DSLDSILTLA KTEGMLFKWG SGTGTNLSPL RSSNEQLSGG
GTASGPLSFM RGFDAFAGVI KSGGKTRRAA KMVILNIEHP DIVEFIDCKA KEEAKAWALM
QQGYDGSSPD AEAYSSIFFQ NANNSVRVTD DFMYAVVRDA DFSTKAVKSG DVVKTFKARW
LLSKISEATW QCGDPGMQYD TTVNRWHTSK NTARINASNP CSEYMFLDDS ACNLASLNLM
KFAPNGTFDV AAYRHACAIT ITAQEILVDN AGYPTESIMK NSHDYRPLGL GYANLGALLM
AAGLPYDSDA GRDYAACVTA IMCGEAYLQS SKIAELGQPL TPATPITQSV EITGSACPGW
YVNREPFLDV IRMHRASVNN INQKNVPAPV FEAAKATWDE ALQHGERHGY RNSQVTVLAP
TGTIGFMMDC DTTGIEPDLA LVKYKKLVGG GMIKIVNNTV PAALFKLGYT SDQANATVSY
IDATGTIEGA PHIKDEHLAV FDCSFKPAKG TRTISYMGHV KMMAATQPFI SGAISKTVNL
PKDATIDDIM EAYIQSWKLG LKAVAIYRDG SKGGQPLNAS GTSTTDKLKK AEAAPVASES
EDDAAAPPRA IRHRLPDERL SVTHKFNIGG HEGYITVGLY KDGMPGEVFI TMAKEGSTVS
GLMDSFACAT SLALQHGVPL KLLCEKFAHT RFEPSGWSHN PDIGFAKSIM DYIFRWLQLR
FLTGQQQALF EGLRPKYMMG TPSSSDASSS SVILSEGAPA PESKDPYRYT PPGGPVAHHV
HTADALKDLI DLGDAPSCHV CGAIMTRNGS CYRCNECGST SGCS