Gene RPD_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1788 
Symbol 
ID4022270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2002191 
End bp2003792 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content69% 
IMG OID637961982 
Product4-diphosphocytidyl-2C-methyl-D-erythritol synthase 
Protein accessionYP_568925 
Protein GI91976266 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0303] Molybdopterin biosynthesis enzyme
[COG2068] Uncharacterized MobA-related protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.148843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.17866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTCG GCCCGCGACG CCCGGCGGAT GCGATCGGCG GCGTCACCGT GCATTCGCTG 
CGGCAGAACG GATTGCTGCT GAAGAAGGGC ACCTCAATCG GTCCAGCCGA AGTAGCGGCG
CTGGAGCACG CTGGTGTCGC CGAGATCGTC GTGGTGCAGC TCGAGCCGGG TGACGTCTCC
GAGGATGTCG CCGCCGCTGA TGTGGCGCAG GCGGTCGCCG GCGACGGCGT CAGTGTCGAG
CGCGCCTTCA CCGGCCGCGC CAATCTCTTT GCAAAGCGTC CGGGCGTGCT GGTGGTCGAG
CGTGCCGTGG TGGATCGTGT CAACGCCATC GACGAGGCGA TCACCTTCGC GACGCTTGCC
GCGTTCAAGC CGGTGGTCGA AGGCGAGATG ATCGCGACCG TCAAGCTGAT CCCGTTCGGC
GTCGAAGGAA AGCTCCGCGA CGCCGCGGTG CTGGCCGCGC AAGGCGGCGC GTTGCGTGTC
GCGCCCTACA TGGTCAAGCG CGTCGGCGTA GTGTCGACCC AGTTGCCGGG CCTCGCGCCG
AAAGTGATCG ACAAGACGCT GCGCGTCACA GCGGAGCGGC TGGCGCCGGC CGGTGCCGTG
ATCATCGCCG AGCGCCGCAT CGCCCATGAT GAAGCCGCGT TGTCGGCGGC GCTGAAGGAG
CTGCTCGGGC TCGGCGCCGA ACTGGTGGTC GTGTTCGGCG CCTCCGCGAT CGCCGATCGG
CGGGACGTGA TCCCGGCCGC GATCGGCGCC ATCGGCGGCA CGATCGAGCA TTTCGGGATG
CCGGTCGACC CCGGCAATCT GCTGCTGATC GGCAGCGCGT CGGGCGTGCC GGTGCTCGGG
GCGCCGGGTT GCGCGCGCTC GCCGGTCGAG AACGGCTTCG ACTGGGTGCT GATGCGGCTG
CTCGCGGGGC TTCAGGTGAC GCGCGCGGAC ATCACCGGCA TGGGCGTCGG CGGCCTGTTG
ATGGAAATCG TGACGCGGCC GCAGCCGCGC CTGCCGCTGA CCGAAGGCGG TCGCAACGTC
GCCGCGATCG TGCTCGCCGC CGGCCGCTCG ACCCGGATGG GCGGACCGAA CAAACTGCTC
GCGGAGCTGA ACGGCACGCC GTTGGTGCGG ATCGTCGCCG AACAGGTGAT GGCGTCGAAG
GCGTCGCGCG CGATCGTCGT CACTGGGCAC CAGGCCGACA AGGTCGAGGC GGCGCTGTCC
GGCCTCGACG TCTCGTTCGT GCACAACCCC GCCTTTGCCG AAGGCCTGGC CTCGTCGGTG
AAGGCCGGCA TCGCCGCGGT CGCCGACGAT GCGGATGGCG CGGTGGTCTG TCTCGGCGAT
ATGCCGCTGA TCGATTCCCT CTTGATCGAC CGGCTGATCG GTGCGTTCGA TCCGGATCGC
GGCGGGCTGA TCGTGGTGCC GGTCGCGGAT GGTCGGCGCG GCAATCCGGT GTTGTGGTCG
CGTCGCTTCT TCAGCGAGCT GATGACGCTC GACGGCGACA TCGGCGCGCG CCATCTGATC
GCCAAACACG GCGAGGCCGT GACCGAAGTG CCAGTCGACG GCCACGCTGC GTTTCTCGAC
ATCGACACGC CGCAGGCGCT GGAGGAAGCT CGGCGCGGCT AG
 
Protein sequence
MRFGPRRPAD AIGGVTVHSL RQNGLLLKKG TSIGPAEVAA LEHAGVAEIV VVQLEPGDVS 
EDVAAADVAQ AVAGDGVSVE RAFTGRANLF AKRPGVLVVE RAVVDRVNAI DEAITFATLA
AFKPVVEGEM IATVKLIPFG VEGKLRDAAV LAAQGGALRV APYMVKRVGV VSTQLPGLAP
KVIDKTLRVT AERLAPAGAV IIAERRIAHD EAALSAALKE LLGLGAELVV VFGASAIADR
RDVIPAAIGA IGGTIEHFGM PVDPGNLLLI GSASGVPVLG APGCARSPVE NGFDWVLMRL
LAGLQVTRAD ITGMGVGGLL MEIVTRPQPR LPLTEGGRNV AAIVLAAGRS TRMGGPNKLL
AELNGTPLVR IVAEQVMASK ASRAIVVTGH QADKVEAALS GLDVSFVHNP AFAEGLASSV
KAGIAAVADD ADGAVVCLGD MPLIDSLLID RLIGAFDPDR GGLIVVPVAD GRRGNPVLWS
RRFFSELMTL DGDIGARHLI AKHGEAVTEV PVDGHAAFLD IDTPQALEEA RRG