Gene RPB_3672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3672 
Symbol 
ID3911474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4212307 
End bp4213908 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content69% 
IMG OID637885574 
Product4-diphosphocytidyl-2C-methyl-D-erythritol synthase 
Protein accessionYP_487278 
Protein GI86750782 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0303] Molybdopterin biosynthesis enzyme
[COG2068] Uncharacterized MobA-related protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCG GGCCGCGCCG TCCGGCCGAT GCGATCGGCG GCGTCACCGT GCATTCGCTG 
CGGCAGAACG GATTGCTGCT GAAGAAGGGC ACCGCGATCG GTCCGGCCGA AGTCGACGCG
CTGGAGCGCG CCGGCGTCGG TGAAATTGTC GTCGTTCAAC TCGAGTCGGG TGATGTCTCG
GAGGACGTCG CAGCAGCGGA CGTGGCGCAG GCCGTCGCCG GCGACGGTGC CAGCGTCGAG
CGCGCCTTCA CCGGCCGCGC CAATCTGTTC GCGCAGCGGC CCGGCGTGCT GGTGGTTGAT
CGCGCCGCGG TCGATCGGGT CAATGCGGTC GACGAGGCGA TCACCTTCGC GACGCTGCCG
GCGTTCAAGC CGGTGGTCGA AGGCGAGATG ATCGCGACCG TCAAGCTGAT CCCGTTCGGC
GTCGAGGGGA GACTGCGCGA CGCCGCGGTG GCGGCTGCAC GAGGCTCCGC GCTGCAGGTC
GCGCCCTATG TCATCAAGCG TGTCGGCATC GTGTCGACGC AACTGCCCGG CCTCGCGTCC
AAGGTGATCG ACAAGACGCT GCGCGTCACC GCCGAGCGGC TGGCGCCGGC GGGTGCCGAG
ATCATCGCCG AGCGCCGCAT CGCTCATGAC GAATCTGCGC TCGCAACGGC GCTGCAGGAA
TTGCTCGGCC TCGGCGCCGA GCTGGTGATC GTGTTCGGCG CCTCGGCGAT CGCAGACCGC
CGCGACGTCA TCCCGGCGGC GATCGGCGCC ATCGGCGGGC AGGTCGAGCA CTTCGGTATG
CCGGTCGATC CCGGCAATCT GCTGCTGATC GGCAGCGCGT CGGGCGTCCC GGTGCTGGGT
GCGCCGGGCT GTGCGCGCTC GCCGGTCGAG AACGGCTTCG ACTGGGTGCT GATGCGGCTG
CTGGCGGGAT TGCCCGTGAC GCGCGCCGAT ATCACCGGCA TGGGTGTCGG CGGGTTGCTG
ATGGAGATCG TGACCCGACC GCAGCCGCGC GTGCCGGTAG CCGAAGGTGG CCGCAATGTC
GCGGCGATCG TGCTCGCCGC CGGCCGCTCG ACCCGGATGG GCGGGCCGAA CAAGCTGCTC
GCCGAACTGA ACGGCACGCC GCTGGTGCGG ATCGTGACCG AGCAGGTATT GGCGTCGAAG
GCATCGCGCG CGGTCGTGGT CACCGGGCAT CAGGCCGACA AGGTCGAGGC GGCGCTGTCC
GGGCTCGATG TGTCGTTCGT CCATAACCCG GCGTTCGCCG AAGGGCTGGC GTCGTCGGTC
AAAGCCGGTA TCGCCGCTGT GCCGGACGAT GCCGATGGCG CGATTGTTTG TCTCGGCGAC
ATGCCGCTGA TCGATTCCGA ACTGATCGAC CGGCTGATCG ACGCGTTCGA TCCGGATCGC
GGCGGGCTGA TCGTGGTGCC GGTCGCAGAT GGCCGCCGCG GCAATCCAGT GCTGTGGTCG
CGGCGGTTCT TCGCCGAGCT GATGACGCTC GACGGCGACA TCGGCGCGCG CCACCTGATC
GCCAAGCATG CCGAGGCGGT GACCGAAGTG CCGGTCGATG GCCACGCTGC GTTTCTCGAT
ATCGATACGC CGCAGGCGCT CGAGGATGCC CGCCGGGGCT GA
 
Protein sequence
MKFGPRRPAD AIGGVTVHSL RQNGLLLKKG TAIGPAEVDA LERAGVGEIV VVQLESGDVS 
EDVAAADVAQ AVAGDGASVE RAFTGRANLF AQRPGVLVVD RAAVDRVNAV DEAITFATLP
AFKPVVEGEM IATVKLIPFG VEGRLRDAAV AAARGSALQV APYVIKRVGI VSTQLPGLAS
KVIDKTLRVT AERLAPAGAE IIAERRIAHD ESALATALQE LLGLGAELVI VFGASAIADR
RDVIPAAIGA IGGQVEHFGM PVDPGNLLLI GSASGVPVLG APGCARSPVE NGFDWVLMRL
LAGLPVTRAD ITGMGVGGLL MEIVTRPQPR VPVAEGGRNV AAIVLAAGRS TRMGGPNKLL
AELNGTPLVR IVTEQVLASK ASRAVVVTGH QADKVEAALS GLDVSFVHNP AFAEGLASSV
KAGIAAVPDD ADGAIVCLGD MPLIDSELID RLIDAFDPDR GGLIVVPVAD GRRGNPVLWS
RRFFAELMTL DGDIGARHLI AKHAEAVTEV PVDGHAAFLD IDTPQALEDA RRG