Gene RPD_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4089 
Symbol 
ID4024606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4544791 
End bp4546098 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content69% 
IMG OID637964292 
Productdihydrodipicolinate reductase 
Protein accessionYP_571209 
Protein GI91978550 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4091] Predicted homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTC ATCATCTGCT CGCCGCCCGC GCCGACGCCG GCCGGCCGGT TCGCGTCGCA 
CTGATCGGCG CCGGCAAATT CGGCTCGATG TTTCTCGCCC AGACGCCGCA CACCCGCGGG
CTCGAGGTGG CGGCGATCGT CGATCTCGAT CCCGAGCGCG CCCGTGACGC CTGCCGCCAT
GTCGGCTGGG ACGAAGGCAG GATCGCGGCG ACGCGCTTCG ACAGCGATCC GGCCGCGGCG
AATGCCGCCG GCATCGAGGT CGTGGTCGAA GCGACCGGCA ATCCCGCCGC CGGCATCCGC
CATGCGCGCG CGGCGATCGC CGCCGGACAG CATGTGGTGA TGGTCAATGT CGAGGCCGAC
GTGCTGGCGG GGCCGCTGCT CGCCGACGAG GCGCGGCGCG CCGGCGTGGT CTATTCGCTG
GCCTATGGCG ATCAGCCGGC GCTGACCGCC GAGCTGGTGG ACTGGGCGCG CGCGACCGGA
TTCCGCGTCG TCGCCGCCGG CAAGGGCACC AAATATCTGC CGATCTATCA CGACGTCACG
CCGGCCGGGG TGTGGAGCCA TTACGGTCTT TCCGCCGCCG AGGCGCAATC GGCCGGGATG
AATCCGCAGA TGTTCAACTC GTTTCTCGAC GGCACGAAAT CCGCGATCGA AATGGCGGCG
ATCGCCAATG CGACCGGACT CGACGTGCCT GCAGCCGGGC TCGCTTTTCC GCCCTGCGGG
GTCGACGATC TGCCGCATGT GCTGCGGCCG CGCGGAGATG GCGGGGTGCT GGAGCGATCC
GGCATGGTCG AAGTGGTGTC GTCGCTGGAG CGTGACGGCC GACCGGTGTT CCGCGATCTG
CGCTGGGGCG TCTATGTGGT GATCGAAGCG CCGAACGATT ACGCCGCCGA TTGCTTCAAG
CAATACGGGC TGAAGACGGA TTCGAGCGGC CGCTACGCCG CGATGTACAA GCCGTATCAC
CTGATCGGGC TCGAGCTCGG CATTTCGGTG CTGTCGGCGG CGCTGCGGCG CGAACCGACT
GGGCAGCCGC GCGACTTCCG CGGCGATGTC GTCGCGGTGG CGAAGCGGGA TCTGAAAGCC
GGCGAAATGC TCGACGGCGA AGGCGGCTAT ACGGTGTGGG GGAAGTTGAT GCGCGCATCC
GACAGCCTGA CGGCCGGCGC GCTGCCGATC GGGCTCGCGC ACCGGGTCAG ACTGACCAGC
GATGTCGGCC ACGGCGGTGT GGTGCGCTGG TGTGACGTCG AGATCGACAA GAGCGATCCG
ACCGTGGCGA CCAGGCGGGC GATGGAGCAG GCATTTTCGG GACGCTGA
 
Protein sequence
MNLHHLLAAR ADAGRPVRVA LIGAGKFGSM FLAQTPHTRG LEVAAIVDLD PERARDACRH 
VGWDEGRIAA TRFDSDPAAA NAAGIEVVVE ATGNPAAGIR HARAAIAAGQ HVVMVNVEAD
VLAGPLLADE ARRAGVVYSL AYGDQPALTA ELVDWARATG FRVVAAGKGT KYLPIYHDVT
PAGVWSHYGL SAAEAQSAGM NPQMFNSFLD GTKSAIEMAA IANATGLDVP AAGLAFPPCG
VDDLPHVLRP RGDGGVLERS GMVEVVSSLE RDGRPVFRDL RWGVYVVIEA PNDYAADCFK
QYGLKTDSSG RYAAMYKPYH LIGLELGISV LSAALRREPT GQPRDFRGDV VAVAKRDLKA
GEMLDGEGGY TVWGKLMRAS DSLTAGALPI GLAHRVRLTS DVGHGGVVRW CDVEIDKSDP
TVATRRAMEQ AFSGR