Gene Rsph17029_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2997 
Symbol 
ID4898354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp5780 
End bp8506 
Gene Length2727 bp 
Protein Length908 aa 
Translation table11 
GC content69% 
IMG OID640113599 
Productmethionine synthase 
Protein accessionYP_001044870 
Protein GI126463757 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1410] Methionine synthase I, cobalamin-binding domain 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR02082] 5-methyltetrahydrofolate--homocysteine methyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.199846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGTT ACCTTCGTCT CTCGGGCCTC GAGCCCTTCG TCCTCACGCC CGACATCCCC 
TTCGTCAACG TGGGCGAGCG GACGAACGTC ACGGGCTCGG CGCGCTTCCG CAAGCTCATC
ACCAACCGCG ACTATGCGGC GGCGCTCGAG GTCGCGCGCG ATCAGGTGCA GAACGGGGCG
CAGATCCTCG ACGTGAACAT GGACGAGGGG CTGATCGACT CGAAGGCCGC CATGGTCGAA
TTCCTCAACC TGATCGCGTC CGAGCCCGAC ATCGCGCGGG TGCCGCTGAT GATCGACAGC
TCGAAATGGG AGGTGATCGA GGCCGGGCTG CAATGCGTGC AGGGCAAGCC CGTGGTCAAT
TCGATCAGCC TGAAGGAGGG CGAGGAGTCG TTCCGGCGGC AGGCGGGCCT CTGCCTCGCC
TACGGGGCCG CGGTCGTGGT CATGGCCTTC GACGAGGAGG GGCAGGCCGA CAGTTTCGCC
CGCAAGACCA CGATCTGCGC CCGGGCCTAC CGCATCCTCG TCGAGGAGGT GGGCTTCCCG
CCCGAGGACA TCATCTTCGA CCCGAACGTC TTCGCCGTGG CCACGGGCAT CGAGGAGCAC
GACAATTACG GCGTCGACTT CATCGAGGCC GCGCGCTGGA TCCGGGCGAA ACTGCCCCAT
GCCCATGTCT CGGGCGGCGT GTCGAACCTC TCCTTCAGCT TCCGCGGCAA CGAGCCCGTG
CGCGAGGCGA TGCATGCGGT CTTCCTCTAC CATGCGATCC GGGCCGGGAT GGACATGGGG
ATCGTGAATG CGGGCCAGCT TGCGGTCTAC GACCAGATCG ATCCCGACCT GCGCGAGGCC
TGCGAGGATG TGGTGCTGAA CCGCAAGCCT AAGCAGGGCG GCACCGCGAC CGAGCGGCTT
TTGGCCGTGG CCGAGCGGTT CCGGGGCGGC GCGCGCGAGG AGAAGACGCG CGATCTGGCC
TGGCGCGGCT GGCCGGTCGA GAAGCGGCTC GAACATGCGC TGGTCAACGG CATCACCGAA
TTCATCGAGG CCGACACCGA AGAGGCCCGG CAGGTGGCCG AGCGGCCGCT CCATGTGATC
GAGGGCCCGC TGATGGCGGG GATGAACGTG GTGGGCGACC TCTTCGGCGC GGGAAAGATG
TTCCTGCCGC AGGTGGTGAA GTCGGCCCGG GTGATGAAGC AGGCCGTGGC CGTGCTCCTG
CCCTACATGG AGGAGGAGAA GCGTCTCGGC GGCGGCGAGG GCCGCGAGGC GGCGGGCAAG
GTGCTCATGG CCACGGTGAA GGGCGATGTG CATGACATCG GCAAGAACAT CGTGGGCGTG
GTTCTGGCCT GCAACAATTA CGAGATCATC GATCTGGGCG TGATGGTGCC GCCGGCGAAG
ATCCTCGAGG TGGCGCGGGC CGAGAAGGTG GATGCGATCG GCCTCTCGGG CCTCATCACG
CCCTCGCTCG ACGAGATGGT GACGGTGGCG GCCGAGATGG AACGCGAAGG GTTCGACATC
CCGCTGCTGA TCGGCGGGGC CACTACCTCG AAGGTGCATA CGGCGGTGAA GATCGCGCCC
TCCTACTGCC GGGGCCCCGC GGTCTATGTG ACCGATGCAA GCCGCGCGGT GGGGGTGGTG
GGCCAGCTTC TGAGCGCCGA GCGCAAGGGC GCCTATGTCG AGGGGCTGCG CGCCGACTAT
GCCGAGGTGG CCGAGCGCCA CGCCCGGTCG GAACGCGCCA AGCAGCGCCT GCCGCTGGCG
GCGGCGCGGG CCAATGCGCT GAAGCTCGAC TGGGCCGCGC ACCGGCCGGT GCGGCCGAGC
TTCACCGGCA GTCGGACCGT CGACGGCTGG GATCTGGCCG AGATCGCGCG CTACATCGAC
TGGACGATGT TCTTCCAGAC CTGGGAGTTG AAGGGGGTCT ATCCGCGCAT CCTCGAGGAT
CCGGCGCAGG GCGAGGCCGC CCGCGCGCTC TTTGCCGATG CGCAGGAGAT GCTCGCGCGC
ATCGTGGCGG AACGCTGGTT CACGCCGCGT GCGGTGGTGG GCTTCTGGCC CGCCAATGCG
GTGGGCGACG ACATCCGCCT CTATGCCGAC GAGGCGCGAC GGGCCGAGCT TGCCACCTTC
TTCACGCTCC GCCAGCAGGT CACGAAGCGC GAAGGGCGGC CCAATCTGGC CCTGTCGGAT
TTCGTGGCGC CCGAAGGGGC GGGCCCCGAC TGGGTCGGGG GGTTCGTGGT GACGGCCGGC
CCCGAGGAGG CTTCCATCGC CGAGCGGTTC GACCGGGCGA ACGACAATTA TTCCGCCATC
ATGGTCAAGG CGCTGGCCGA CCGCTTCGCC GAGGCCATGG CCGAGAGGCT GCACGAGAGG
GTGCGCCGCG AGCTCTGGGG CTACGCCCCT GACGAGTCCT TCACGCCCGA TGCGCTTCAT
GCCGAGCCCT ACCGCGGCAT CCGGCCCGCA CCCGGCTACC CGGCCCAGCC CGATCATACC
GAGAAGGTCA CGCTCTTCCG CCTGCTCGAT GCGACCGCGG CCACCGGGGT GGAGCTGACC
GAGAGCATGG CCATGTGGCC GGGCTCGTCG GTCTCGGGCC TCTATATCGG CCATCCCGAC
GCCTATTACT TCGGCCTCGC CCGGGTCGAG CGCGATCAGG CCGAGGATTA TGCCCGCCGC
AAGGGCATGG ATCTGGCCGA GGTCGAGCGC TGGCTCGCCC CGGTCATGGC TGGCCGGGTC
GAGACGCCCT CTTCGCGGGC CGCCTGA
 
Protein sequence
MTRYLRLSGL EPFVLTPDIP FVNVGERTNV TGSARFRKLI TNRDYAAALE VARDQVQNGA 
QILDVNMDEG LIDSKAAMVE FLNLIASEPD IARVPLMIDS SKWEVIEAGL QCVQGKPVVN
SISLKEGEES FRRQAGLCLA YGAAVVVMAF DEEGQADSFA RKTTICARAY RILVEEVGFP
PEDIIFDPNV FAVATGIEEH DNYGVDFIEA ARWIRAKLPH AHVSGGVSNL SFSFRGNEPV
REAMHAVFLY HAIRAGMDMG IVNAGQLAVY DQIDPDLREA CEDVVLNRKP KQGGTATERL
LAVAERFRGG AREEKTRDLA WRGWPVEKRL EHALVNGITE FIEADTEEAR QVAERPLHVI
EGPLMAGMNV VGDLFGAGKM FLPQVVKSAR VMKQAVAVLL PYMEEEKRLG GGEGREAAGK
VLMATVKGDV HDIGKNIVGV VLACNNYEII DLGVMVPPAK ILEVARAEKV DAIGLSGLIT
PSLDEMVTVA AEMEREGFDI PLLIGGATTS KVHTAVKIAP SYCRGPAVYV TDASRAVGVV
GQLLSAERKG AYVEGLRADY AEVAERHARS ERAKQRLPLA AARANALKLD WAAHRPVRPS
FTGSRTVDGW DLAEIARYID WTMFFQTWEL KGVYPRILED PAQGEAARAL FADAQEMLAR
IVAERWFTPR AVVGFWPANA VGDDIRLYAD EARRAELATF FTLRQQVTKR EGRPNLALSD
FVAPEGAGPD WVGGFVVTAG PEEASIAERF DRANDNYSAI MVKALADRFA EAMAERLHER
VRRELWGYAP DESFTPDALH AEPYRGIRPA PGYPAQPDHT EKVTLFRLLD ATAATGVELT
ESMAMWPGSS VSGLYIGHPD AYYFGLARVE RDQAEDYARR KGMDLAEVER WLAPVMAGRV
ETPSSRAA