Gene Caul_3402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3402 
Symbol 
ID5900857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3674632 
End bp3677301 
Gene Length2670 bp 
Protein Length889 aa 
Translation table11 
GC content66% 
IMG OID641563908 
Productmethionine synthase 
Protein accessionYP_001685027 
Protein GI167647364 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1410] Methionine synthase I, cobalamin-binding domain 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR02082] 5-methyltetrahydrofolate--homocysteine methyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.131568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0280549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCTG TCTTCGTCAA TATCGGTGAG CGCACCAACG TCACCGGCTC CGCCAAGTTC 
AAGAAGCTGA TCGTCGAGGG CGACTATCCC GCCGCCCTGT CCGTCGCCCG CCAGCAGGTC
GAGGCCGGCG CCCAGGTCAT CGACGTCAAC ATGGACGAGG GTCTGCTGGA CTCCAAGCAG
GCCATGGTCA CCTTCCTGAA CCTGATGGCG GCCGAGCCCG ACATCGCCCG GGTGCCGGTG
ATGATCGACA GCTCCAAGTG GGAGGTGATC GAGGCCGGCC TGAAGTGCGT GCAGGGCAAG
GCCATCGTCA ACTCGATCTC GATGAAGGAA GGCGAGGCCA AGTTCATCGA GCAGGCCAAG
CTGTGCCTGC GCTACGGCGC GGCCGTGGTG GTCATGGCCT TCGACGAGCA GGGCCAGGCC
GACACCGCCG CCCGCAAGAT CGAGATCTGC GAGAAGGCCT ATCGCATCCT GGTCGACAAG
GTGAGCTTCC CGCCGGAAGA CATCATCTTC GACCCCAACA TCTTCGCCGT GGCGACCGGG
ATCGAAGAGC ACGACAACTA CGCCGTCGAC TTCATCGAGG GCGCGCGCGA GATCAAGAAG
CGCTGCCCCT ACGCCCGGAT CAGCGGCGGG GTGTCGAACG TCTCGTTCAG CTTCCGCGGC
AACGAGCCCG TGCGCCGGGC GATCCACTCG GTGTTCCTGT ACCACGCCAT CGCGGCGGGC
ATGGACATGG GCATCGTCAA CGCCGGCGAC CTGCCGGTCT ATGACGCCCT GGACCCCGAG
TTGCGCGAGG CCGTCGAGGA CGTGATCCTC AATCGCCCGC AGCGAACCAA CGTCACCAAC
ACCGAGCGCC TGGTCGACAT GGCCCCGCGC TACAAGGGCG ACAAGAGCCA GGTCCAGACC
GCCAATCTGG AATGGCGCAA GGGCAGCGTG AACGAGCGCA TCACCCACGC TCTGGTCAAC
GGCATCACCG AATTCATCAA CGAGGACACC GAGGAAGCCC GCCTGTCGGT CGAACGGCCG
CTGCACGTCA TCGAAGGCCA TCTGATGGAC GGCATGAACG TGGTCGGCGA CCTGTTCGGC
TCGGGCAAGA TGTTCCTGCC CCAGGTGGTC AAGTCGGCCC GGGTGATGAA ACAGGCCGTG
GCCTGGCTCG AACCCTTCAT GGAGGCTGAG AAGGCCGGCA AGCCGCGCGA GCAGGCCGGC
CGCATCCTGA TGGCCACCGT CAAGGGCGAC GTCCACGACA TCGGCAAGAA CATCGTCGGC
GTCGTGCTCC AGTGCAACAA CTACGAGGTC ATCGACCTGG GCGTGATGGT GCCGGCCGAC
CGCATCCTGG ACGAGGCGCG CAAGCACAAC GTCGACATGA TCGGCCTGTC GGGCCTGATC
ACCCCGTCGC TGGACGAGAT GGTGTTCGTG GCCTCAGAGA TGGAGCGCCA GGGCTTCACC
ATGCCGCTGC TGATCGGCGG CGCCACCACC AGCCGCACCC ACACCGCCGT CAAGATCGAG
CCGGCCTATC ACGCTGGCTC GACGACCTAT GTGCTGGACG CCAGCCGCGC GGTGGGCGTG
GTCTCGGGCC TGCTGTCGGC CAGCGAGCGC GATCGTCTGC AGGCCGAGAC GCGGGCCGAA
TATGTCCGCA TCCGCGAGCA ATATGCCCGG GGCCAGACGG CCAAGGCGCG CACCAAGATC
AGCGACGCCC GCCAGCGCAA GTTCGCCATC GACTGGGAGG GCTATGCGCC GCCCAAGCCC
AGCTTCATCG GCGCGCGCAC CTTCGAGCCG TCGCTGGAAG AGCTGGTCCC GTTCATCGAC
TGGTCGCCGT TCTTCGCCAG CTGGGAGCTG ATCGGCCGCT TCCCGCAGAT CCTCGAGGAC
GACGTGGTCG GCGAGGCCGC CACTGACCTC TATCGCGACG CCCGCGAGAT GCTCGACAAG
GTCGTGGCCG AGAAGTGGTT CGGGGCCAAG GGCGTGGTCG GCTTCTGGCC GGCCCAGGCC
GACGGCGACG ACATCGTTCT CTACACGGAC GAAACCCGCA CGACCGAGCT ATCGCGGCTG
TTCACCCTGC GCCAGCAGAT GGACAAGTCC GAGGGCAAGG CCAACCTGGC GCTGTCGGAC
TTCGTCGCGC CGATCGGGCA GGGGGCCGAC TACATGGGCG GCTTCGCCGT CACCGCCGGC
CATGGCGAGG ACGAGATCGT CAAGCGCTTC AAGGACGCCG GCGACGACTA CAGCGCCATC
ATGGCCTCGG CCCTGGCCGA CCGCCTGGCC GAAGCCTTCG CCGAATGGCT GCACTACAGG
GTCCGCGTCG AGCTCTGGGG CTATGCGCCG GGCGAACTGC GCGACACCGA CCTGATGATC
GCCGAGAAGT ACCAGGGCAT CCGCCCGGCC CCCGGCTATC CGGCCCAGCC CGACCACACC
GAGAAGGGCA CGCTGTTCAA GCTGCTGGAC GCCGAAGCCG CCACCGGCAT GATCCTGACC
GAGAGCTACG CCATGAGCCC CGGCGCGGCG GTCTCCGGTT TCTATTTCAG CCACCCGCAG
AGCCACTATT TCGGCGTCGG CAAGGTCGAT CTCGACCAGG TCGAGGACTA TGCCCGCCGT
AAGGGCTGGG ACCTGGCCAA GGCCGAGAAA TGGCTGTCGC CGATCCTCAA CTACAACCCC
GGCGCCAAGG CGCGGGGGGA GGCGGCGTAG
 
Protein sequence
MRPVFVNIGE RTNVTGSAKF KKLIVEGDYP AALSVARQQV EAGAQVIDVN MDEGLLDSKQ 
AMVTFLNLMA AEPDIARVPV MIDSSKWEVI EAGLKCVQGK AIVNSISMKE GEAKFIEQAK
LCLRYGAAVV VMAFDEQGQA DTAARKIEIC EKAYRILVDK VSFPPEDIIF DPNIFAVATG
IEEHDNYAVD FIEGAREIKK RCPYARISGG VSNVSFSFRG NEPVRRAIHS VFLYHAIAAG
MDMGIVNAGD LPVYDALDPE LREAVEDVIL NRPQRTNVTN TERLVDMAPR YKGDKSQVQT
ANLEWRKGSV NERITHALVN GITEFINEDT EEARLSVERP LHVIEGHLMD GMNVVGDLFG
SGKMFLPQVV KSARVMKQAV AWLEPFMEAE KAGKPREQAG RILMATVKGD VHDIGKNIVG
VVLQCNNYEV IDLGVMVPAD RILDEARKHN VDMIGLSGLI TPSLDEMVFV ASEMERQGFT
MPLLIGGATT SRTHTAVKIE PAYHAGSTTY VLDASRAVGV VSGLLSASER DRLQAETRAE
YVRIREQYAR GQTAKARTKI SDARQRKFAI DWEGYAPPKP SFIGARTFEP SLEELVPFID
WSPFFASWEL IGRFPQILED DVVGEAATDL YRDAREMLDK VVAEKWFGAK GVVGFWPAQA
DGDDIVLYTD ETRTTELSRL FTLRQQMDKS EGKANLALSD FVAPIGQGAD YMGGFAVTAG
HGEDEIVKRF KDAGDDYSAI MASALADRLA EAFAEWLHYR VRVELWGYAP GELRDTDLMI
AEKYQGIRPA PGYPAQPDHT EKGTLFKLLD AEAATGMILT ESYAMSPGAA VSGFYFSHPQ
SHYFGVGKVD LDQVEDYARR KGWDLAKAEK WLSPILNYNP GAKARGEAA