Gene Rsph17025_3658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3658 
Symbol 
ID5085518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp547921 
End bp550623 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content69% 
IMG OID640485216 
Producthypothetical protein 
Protein accessionYP_001169831 
Protein GI146279673 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1410] Methionine synthase I, cobalamin-binding domain 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR02082] 5-methyltetrahydrofolate--homocysteine methyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0544622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCC ACCTCCGCCT CTCGGGCCTC GAGCCCTTCG TGCTGACCCC CGACATTCCC 
TTCGTCAACA TCGGCGAGCG GACGAACGTC ACGGGCTCGG CGCGGTTCCG CAAGCTGATC
ACCAACCGCG ACTATGCGGC CGCCCTCGAG GTCGCGCGCG ATCAGGTGCA GAACGGCGCG
CAGATCCTCG ACGTGAACAT GGACGAGGGG CTGATCGACT CGAGGGCCGC CATGGTCGAG
TTCCTCAACC TGATCGCGGC CGAGCCCGAC ATCGCCCGCG TGCCGCTGAT GATCGACAGC
TCGAAATGGG AGGTGATCGA GGCGGGGCTG AAATGCGTGC AGGGCAAGCC CGTCGTCAAT
TCGATCAGCC TGAAGGAGGG CGAGGAGATC TTCCGCCAGC AGGCCGGGCT CTGCCTCGCC
TATGGCGCGG CGGTGGTCGT CATGGCCTTC GACGAAGAGG GACAGGCCGA CAGCTTCGCC
CGCAAGACCG GGATCTGCGC CCGCGCCTAC CGCATCCTCG TCGAGGAGAT CGGCTTCCCG
CCCGAGGACA TCATCTTCGA CCCGAACATC TTCGCCGTGG CCACCGGCAT CGAGGAGCAC
GACAACTACG GCGTCGATTT CATCGAGGCC ACCCGCTGGA TCCGGGCGAA CCTGACCCAT
GCCCATGTTT CGGGCGGCGT CTCGAACCTG TCCTTCAGCT TCCGCGGCAA CGAGCCCGTG
CGCGAGGCGA TGCATGCGGT CTTCCTCTAC CATGCCATCC GGGCAGGGAT GGACATGGGG
ATCGTCAACG CGGGCCAGCT GGCTGTGTAT GACCAGATCG ACCCAGAGCT GCGCGAGGCC
TGCGAGGATG TGGTCCTGAA CCGTCGTCGC GACGCGACCG AGCGGCTTCT GGCCGTTGCG
GACCGCTTCC GCGGCGGCGC CCGCGAGGAG AAGGTGCGCG ATCTGGCGTG GCGGGACTGG
CCGGTGGAAA AACGGCTGGA ACATGCGCTG GTGAACGGCA TCACCGAATT CATCGAGGCC
GACACCGAGG AGGCCCGGCT TGCGGCCGAG CGGCCGCTGC ATGTGATCGA GGGCCCGCTG
ATGGCGGGGA TGAACGTGGT GGGCGATCTC TTCGGCGCGG GCGAGATGTT CCTGCCGCAG
GTGGTGAAGT CGGCGCGTGT GATGAAGCAG GCGGTCGCGG TGCTTCTGCC CTACATGGAG
GCCGACAAGG GCGGTGCCCG CGAGGCGGCC GGCAAGGTGC TGCTGGCAAC CGTCAAGGGC
GACGTGCACG ACATCGGCAA GAACATCGTC GGCGTCGTTC TGGCCTGCAA CAACTACGAG
ATCATCGACC TCGGCGTCAT GGTGCCGCCG GCGAGGATCC TCGAGGTGGC GCGGGCCGAG
AAGGTGGATG TGATCGGCCT GTCGGGCCTC ATCACCCCGT CGCTGGACGA GATGGTGACG
CTCGCCGCAG AGATGGAGCG CGAGGGCTTC GGGGTTCCAC TGCTGATCGG GGGGGCCACC
ACCTCGCGCG TGCACACCGC CGTCCGGATC GCGCCCGCCT ATCACCGCGG CCCGGCGGTC
CATGTTGCCG ACGCCAGCCG GGCGGTGGGC GTGGTGAGCC AGTTGCTCAG CCCGACGCAA
AGGCAGGCCT ATGTCGAGGG GCTGCGCGCC GATTACGCGC AGGTGGCCGA GCGCCATGCG
CGGTCCGAGC GCGCCAAGCA GCGCCTCTCG CTCGCCGCCG CGCGGGCCAA TGCCCTCCGG
CTCGACTGGC CGTCCTATTC CGCCACCCCG CCCACCTTCA CCGGGGTCCG GGTGATCGAG
GACTGGGATC TGGCCGAGAT TGCGCGCTAC ATCGACTGGA CGATGTTCTT CCATGCATGG
GAGATGAAGG GGGTCTGGCC GCGCATCCTC GAGGATGAGG CGCAGGGCGA GGCGGCACGC
GCGCTCTTTG CCGACGCGCA GGCGATGCTC GCGCGGCTGG TGGCGGACCG CTGGTTCACG
CCCCGCGCGG TGGTGGGCTT CTGGCCGGCC AATGCGGTGG GCGACGACAT CCGCCTCTAT
GCCGACGAGA GCCGGCGCGA GACGCTCGCC ACCCTCCACA CGCTGCGCCA GCAGGTGCCC
AAGCGCGAGG GCCGGCCGAA CGTGGCGCTC GCGGACTTCG TGGCGCCCGA GGGGACGGCG
CCCGACTGGG TTGGCGGCTT TGTCGTGACA GCCGGGCCGG AGGAGGCCGC GATTGCCGAC
GGCTTTGACC GTGCGAACGA CAACTATTCC TCGATCATGG TCAAGGCGCT CGCCGATCGC
TTCGCCGAGG CGATGGCCGA GATGCTGCAC GAACGTGTCC GCCGAGACTA CTGGGGCTAC
GCGCCCAATG AGGCCTTCGC GCCCCAGGAC CTCCACGCCG AGCCCTACCG CGGCATCCGG
CCCGCCCCCG GCTATCCCGC CCAGCCGGAC CACACCGAGA AGGTCACGCT GTTCCGCCTG
CTCGATGCGA CAGCGGCAAC GGGGGTGGAG CTGACCGAGA GCATGGCGAT GTGGCCGGGC
TCGTCGGTCT CGGGCCTCTA CATCGCCCAC CCCGAGTCCT ACTACTTCGG CCTCGCACGG
ATCGAGCGCG ACCAGGCCGA GGATTACGCC CGCCGCAAGG GCATGCCGCT GGCCGAGGTC
GAGCGGTGGC TCGCCCCGGT GCTGGGCAGC CGCCGCGAGG CGACCTCTCT GGCGGCGGAG
TAG
 
Protein sequence
MSRHLRLSGL EPFVLTPDIP FVNIGERTNV TGSARFRKLI TNRDYAAALE VARDQVQNGA 
QILDVNMDEG LIDSRAAMVE FLNLIAAEPD IARVPLMIDS SKWEVIEAGL KCVQGKPVVN
SISLKEGEEI FRQQAGLCLA YGAAVVVMAF DEEGQADSFA RKTGICARAY RILVEEIGFP
PEDIIFDPNI FAVATGIEEH DNYGVDFIEA TRWIRANLTH AHVSGGVSNL SFSFRGNEPV
REAMHAVFLY HAIRAGMDMG IVNAGQLAVY DQIDPELREA CEDVVLNRRR DATERLLAVA
DRFRGGAREE KVRDLAWRDW PVEKRLEHAL VNGITEFIEA DTEEARLAAE RPLHVIEGPL
MAGMNVVGDL FGAGEMFLPQ VVKSARVMKQ AVAVLLPYME ADKGGAREAA GKVLLATVKG
DVHDIGKNIV GVVLACNNYE IIDLGVMVPP ARILEVARAE KVDVIGLSGL ITPSLDEMVT
LAAEMEREGF GVPLLIGGAT TSRVHTAVRI APAYHRGPAV HVADASRAVG VVSQLLSPTQ
RQAYVEGLRA DYAQVAERHA RSERAKQRLS LAAARANALR LDWPSYSATP PTFTGVRVIE
DWDLAEIARY IDWTMFFHAW EMKGVWPRIL EDEAQGEAAR ALFADAQAML ARLVADRWFT
PRAVVGFWPA NAVGDDIRLY ADESRRETLA TLHTLRQQVP KREGRPNVAL ADFVAPEGTA
PDWVGGFVVT AGPEEAAIAD GFDRANDNYS SIMVKALADR FAEAMAEMLH ERVRRDYWGY
APNEAFAPQD LHAEPYRGIR PAPGYPAQPD HTEKVTLFRL LDATAATGVE LTESMAMWPG
SSVSGLYIAH PESYYFGLAR IERDQAEDYA RRKGMPLAEV ERWLAPVLGS RREATSLAAE