Gene Rpal_4320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4320 
Symbol 
ID6412004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4647917 
End bp4649518 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content70% 
IMG OID642714202 
Product4-diphosphocytidyl-2C-methyl-D-erythritol synthase 
Protein accessionYP_001993291 
Protein GI192292686 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0303] Molybdopterin biosynthesis enzyme
[COG2068] Uncharacterized MobA-related protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTCG GCCCGCGCCG TCCGGCCGAT GCGATCGGCG GCGTCACCGT GCATTCGCTG 
CGGCAGAACG GTCTGCTGCT GAAGAAGGGC ACGACGATCG GCCCGGCTGA AGTTGCCGCG
CTGGAGAAGG CCGGCATCGC CGAGATCGTG GTGGTCCAGC TCGAACCCGG TGACGTGTCC
GAGGATGTCG CGGCTGCGGA TGTCGCGGGC GCGGTCGCCG GCGATGGCGT CACGGTGGAT
CGCGCGTTCA CCGGCCGCGC CAATCTGTTC GCGGGACGGA CCGGCGTCTT GGTGATCGAT
CGCGCGGTGG TCGATCGCAT CAATGCGGTC GACGAGGCGA TCACTTTGGC GACCCTGCCG
GCGTTCAAGC CGGTCGTCGA AGGCGAGATG GTCGCCACCG TCAAGCTGAT CCCGTTCGGC
GTCGCGAGCC GGCTGCGCGA CGCCGCGGTG GCGGCAGCGG AGGGCAGCGC ACTACGCGTG
GCGCCTTACG TGATCAAGCG CGTCGGCGTG GTGTCGACGC TGCTGCCCGG ACTCGCCCCT
AAGGTGGTCG ACAAGACCCT CCGCGTCACC GCCGAGCGGC TGGCGCCGGC CGGTGCCGAG
ATTATCGCCG AGCGCCGCGT CGCGCATGAA GAGGCCGCGC TGTCGGCCGC GATCAAGGAG
CTACTTGGCC TCGGCGCCGA GATGGTGATC GTGTTCGGCG CCTCGGCGAT CGCCGACCGC
CGCGACGTGA TCCCGGCGGC GATCGGGACC GTCGGCGGCA CGATTTCGCA TTTCGGCATG
CCGGTCGACC CGGGCAATCT CTTGCTGGTC GGCTCGATAT CCGGTGTGCC GGTGCTGGGC
GCGCCGGGCT GCGCGCGCTC GCCGGTCGAG AATGGTTTCG ACTGGGTGCT GATGCGGCTG
CTCGCCGGTC TCACCGTCAC CCGAGCCGAC ATCACCGCGA TGGGTGTCGG CGGCCTGTTG
ATGGAGATCG TGACGCGGCC GCAGCCGCGC GTCCCGGTCG CGGAAAGCGG CCGCAACGTC
GCGGCGATCG TGCTCGCCGC TGGGCGTGGC ACACGGATGG GCGGGCCGAA CAAACTGCTC
GCCGACCTCA ACGGCAAGCC GCTGGTGCGG ATCGTCGCCG AGCAGGCGTT GGCTTCGCAG
GCCGCACGCA CCATCATCGT CACCGGCCAT CAGGCCGGCG AGGTCGAAGC GGCGCTGCAC
GGGCTCGATC TCACCTTCGT GCACAATCCG GACTTCGCCC AAGGCATCGC GTCGTCGGTG
AAGACCGGCA TCGCGGCGCT CGGCGAGGAT GCCGACGGCG CGATCGTCTG TCTCGGCGAC
ATGCCGCTGA TCGACGCCGC GCTGATCGAT CGGATGATCA CAGCGTTCGA TCCCGATCGC
GGCGCGCTGA TCGTAGTGCC GGTCGCAGAG GGGCGCCGCG GCAATCCGGT GCTGTGGTCG
CGGCGGTTCT TTGCCGAGCT GATGACGCTC GGCGGCGACG TCGGCGCGCG GCATCTGATC
GCCAAGCATG GCGAAGCGGT GACTGAAGTG CCGGTCGAAG GTGAGGCGGC ATTCCTCGAC
ATCGACACCC CGCAAGCGCT CGATGAGGCG CGGCGCGGCT AA
 
Protein sequence
MKFGPRRPAD AIGGVTVHSL RQNGLLLKKG TTIGPAEVAA LEKAGIAEIV VVQLEPGDVS 
EDVAAADVAG AVAGDGVTVD RAFTGRANLF AGRTGVLVID RAVVDRINAV DEAITLATLP
AFKPVVEGEM VATVKLIPFG VASRLRDAAV AAAEGSALRV APYVIKRVGV VSTLLPGLAP
KVVDKTLRVT AERLAPAGAE IIAERRVAHE EAALSAAIKE LLGLGAEMVI VFGASAIADR
RDVIPAAIGT VGGTISHFGM PVDPGNLLLV GSISGVPVLG APGCARSPVE NGFDWVLMRL
LAGLTVTRAD ITAMGVGGLL MEIVTRPQPR VPVAESGRNV AAIVLAAGRG TRMGGPNKLL
ADLNGKPLVR IVAEQALASQ AARTIIVTGH QAGEVEAALH GLDLTFVHNP DFAQGIASSV
KTGIAALGED ADGAIVCLGD MPLIDAALID RMITAFDPDR GALIVVPVAE GRRGNPVLWS
RRFFAELMTL GGDVGARHLI AKHGEAVTEV PVEGEAAFLD IDTPQALDEA RRG