Gene Acid345_3426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3426 
Symbol 
ID4070310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4046294 
End bp4048147 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content62% 
IMG OID637985448 
Productbifunctional homocysteine S-methyltransferase/5,10-methylenetetrahydrofolate reductase protein 
Protein accessionYP_592501 
Protein GI94970453 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0646] Methionine synthase I (cobalamin-dependent), methyltransferase domain
[COG0685] 5,10-methylenetetrahydrofolate reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.41819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAGT CATTTCTAGA GCGCCTTGGG GATGGGCCCA TCCTTTGCGA TGGCGCCATG 
GGTACGCTGC TGTACTCCAA GGGCATTTTC ATCAATCGCT GTTACGACGA GCTGAACCTC
TCGCAGCCCG AACTGATCGG CGGGATCCAC GCCGACTATG TCGCGAACGG CGCTGAAATC
CTTGAAACCA ACACCTTCGG CGCGAACTCG TTCCGGTTGG CTCGTCACGG CTGCCAGGAG
AAACTCGCGG ACATCAACCG CGCCGGCGTA GAACTGGTTC GCAAAGCAAT CAAGAACAAT
CAGGTGTATG CGGCCGGCGC GGTTGGGCCG CTCGGCATTC GCATCGAGCC GCTCGGCAAG
ACCAGCCGTG ACGAAGCACG CGATGCGTTC CGCGATCAGA TTCGCGTACT TGTGGATTCA
GGCGTCGACC TGCTGATCCT CGAGACCTTC GGATATCTTG GCGAACTGCA CCAGGCGATC
CTCGCGGCGC GCGATGTCGA TCCCAAAATT CCGGTGGTTG CGCAGGTCAC CATCGACGAA
GACGGCAACT GCCTCGACGG CTCCAGTCCT GAACATTACG GCGCGCGCTT GACGGAGTGG
GGCGCCGACG TCATCGGCTG CAACTGCAGC GTCGGCCCGG TCGCGATGCT CGACGCGCTT
GAGCGCCTCC GCGCGGCAAC ACCGAAGCCG CTCTCCGCGC AACCAAACGC AGGAATTCCG
CGCTCGGTCG AGGGCCGCAA CATTTATCTC TGTTCGCCGG AATACATGGC GAGTTATGCG
CGCAAGTTCG TCGGCGTGGG CGTGACGCTC GTCGGCGGCT GTTGCGGAAC TACGCCGGAC
CACATCCGCT CCATGAAGTC GGCCCTGCGC GTGGGTGAGG CGCGGACCAC TTCTTTTAAG
GTGAAGACCG AGGTGGAAGA ACACGCCGTT GAGCCAACGC CGCTCGGACA GCGTTCTCGT
GTCGGCGCCC GCATTGCGAG TGGCGAATTC CTCACGCTGG TCGAAATTGT TCCGCCTAAG
GGCGTGAACG CCGAGAAAGA AGTCGAAGGC GCTCGATACC TGATGTCCGT CGGGGTGGAC
GCGATCAACA TCCCCGACAG CCCACGTGCT TCCGCGCGCA TGAGCAACCA GGCTCTGTGC
CTGCTGACGC AACAGCAGGT CGGCATTGAG ACGGTGCTGC ACTACACCTG TCGCGACAGG
AACGTTCTGG GCATTCAGTC GGACTTGCTG GGCGCCAGCG CAATCGGGAT CCGTAATTTG
ATCTGTATCA CCGGCGATCC GCCGAAGATG GGCAACTATC CCGACGCTAC CGCCGTGTTT
GATGTGGATG CGATCGGCTT AGTGAACATC GTCAGCAACT TGAATCGCGG GCTGGATATC
GGCGGCAACA CCATCGGCAC CGCGACGCAG TTCACGATTG CGGTCGGGGC GAATCCCGGT
GCAGCAAACC TGGACGAGGA AATCCGCCGC TTCGAGTACA AAGTTGAAGC TGGAGCCGAG
TACGCTGTCA CCCAGCCGGT ATTCGACCTG GCGCTGCTGG AGGAGTTCCT GAAGCGCATT
GAGCACTGCC GCGTTCCGGT GGTTGCCGGG ATTTGGCCGC TGAGCAGCGC CCGTAACGCC
GAATTCATGA GGGACGAACT GAACATCTCC ATTCCTGACG CGATTTACAA CCGGATGGCC
CGCACCACCA ATGCGGACGC CGCCCGGGCA GAAGGTGTCG CGGTCGCCCG GGAAATGCTC
CAGGCAGTCC GGCCAATGGT GCAAGGTACA CAGCTGAGTG CCCCGTTTGG CCGGTATTCG
GCGGCAGCGG ACGTGCTGGA AGTTCTGGGA AGTTCCGACT CCGCCTGCGC GTAA
 
Protein sequence
MGKSFLERLG DGPILCDGAM GTLLYSKGIF INRCYDELNL SQPELIGGIH ADYVANGAEI 
LETNTFGANS FRLARHGCQE KLADINRAGV ELVRKAIKNN QVYAAGAVGP LGIRIEPLGK
TSRDEARDAF RDQIRVLVDS GVDLLILETF GYLGELHQAI LAARDVDPKI PVVAQVTIDE
DGNCLDGSSP EHYGARLTEW GADVIGCNCS VGPVAMLDAL ERLRAATPKP LSAQPNAGIP
RSVEGRNIYL CSPEYMASYA RKFVGVGVTL VGGCCGTTPD HIRSMKSALR VGEARTTSFK
VKTEVEEHAV EPTPLGQRSR VGARIASGEF LTLVEIVPPK GVNAEKEVEG ARYLMSVGVD
AINIPDSPRA SARMSNQALC LLTQQQVGIE TVLHYTCRDR NVLGIQSDLL GASAIGIRNL
ICITGDPPKM GNYPDATAVF DVDAIGLVNI VSNLNRGLDI GGNTIGTATQ FTIAVGANPG
AANLDEEIRR FEYKVEAGAE YAVTQPVFDL ALLEEFLKRI EHCRVPVVAG IWPLSSARNA
EFMRDELNIS IPDAIYNRMA RTTNADAARA EGVAVAREML QAVRPMVQGT QLSAPFGRYS
AAADVLEVLG SSDSACA