Gene RPD_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3804 
Symbol 
ID4024320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4244391 
End bp4246250 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content65% 
IMG OID637964008 
Productdihydroxy-acid dehydratase 
Protein accessionYP_570926 
Protein GI91978267 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTCT ATCGATCCCG AACGACCACT CACGGCCGCA ACATGGCCGG CGCGCGCGGC 
CTGTGGCGCG CCACCGGCAT GAAGGATTCC GATTTCGGCA AGCCGATCAT CGCCGTCGTC
AACTCGTTCA CGCAGTTCGT GCCGGGCCAC GTTCATCTGA AGGACCTCGG GCAGCTCGTT
GCGCGCGAGA TCGAGGCCGC CGGCGGTGTC GCCAAGGAGT TCAACACCAT CGCGGTCGAC
GATGGCATCG CGATGGGGCA CGGCGGAATG CTGTACAGCC TGCCGTCGCG CGAACTGATC
GCCGACAGCG TCGAATACAT GGTCAACGCC CACTGCGCCG ACGCCATGGT TTGCATTTCG
AACTGTGACA AGATCACCCC CGGCATGCTG ATGGCCGCGA TGCGGCTGAA CATCCCCGCG
GTGTTCGTCT CCGGCGGCCC GATGGAAGCC GGCAAGGTGG TGCTGAATGG CAAGACACAC
GCCGTCGACC TGATCGACGC CATGGTCGCG GCCGCCGACA GCAATATGAG CGATGCCGAT
GTGCAGGTGA TGGAGCGCTC GGCGTGCCCG ACCTGCGGCT CGTGTTCGGG CATGTTCACC
GCCAATTCGA TGAACTGCCT CGCCGAGGCG CTGGGTCTCG CGCTGCCCGG CAATGGCTCG
GTGCTCGCCA CCCATGCCGA TCGCAAGCGG CTGTTCGTCG AGGCCGGTCA CACCATCGTC
GATCTGGCGC GGCGTTACTA CGAAGGTGAC GACGAATCCG TGCTGCCGCG CAACGTCGCC
AGCTTCAAAG CGTTCGAGAA CGCGATGACG CTCGACATCG CGATGGGTGG CTCGACCAAT
ACGGTGCTGC ATCTGCTCGC CGCGGCGCGC GAGGCCGAAC TCGACTTCTC GATGAAGGAC
ATCGACCGGC TGTCGCGCAA GGTGCCGTGC CTGAGCAAGA TCGCCCCGTC GGTGTCCGAC
GTTCACATGG AGGACGTGCA TCGCGCCGGC GGCATCATGG CGATCCTCGG CGAGCTCGAT
CGCGCCGGGC TGATCCACAA TTCCTGCCCG ACGGTGCATT CCGAGACGCT CGGTGCCGCA
CTGGCGCGTT GGGACATCCG CCAGAGCAAC AGCGAAGCGG TCCGCACCTT CTACCGCGCC
GCGCCGGGCG GCGTGCCGAC CCAGGTCGCG TTCAGCCAGG ACCGCCGCTA CGACGAGCTC
GACCTCGACC GGCAGAAGGG CGTGATCCGC GACGCGGAGC ATGCCTTCAG CAAGGACGGC
GGTCTCGCCG TGCTGTACGG CAACATCGCC GAAGACGGCT GCATCGTGAA GACCGCGGGC
GTCGACGCCT CGATCCTGAC CTTCTCCGGT CCGGCGAAAG TGTTCGAGAG TCAGGACGAT
GCGGTGTCGG CGATCCTCGG CAACAAGATT GTCGCCGGCG ACGTCATCGT CATCCGCTAC
GAAGGACCGC GCGGCGGACC GGGCATGCAG GAGATGCTGT ATCCGACCAG CTATCTGAAG
TCGAAAGGCC TCGGCAAGGC ATGCGCCTTG ATCACCGATG GCCGTTTTTC AGGCGGCACC
TCGGGGCTTT CGATCGGTCA CGTTTCGCCG GAAGCGGCCG AAGGCGGACT GATCGGTCTG
GTCCGCGATG GCGATCGCGT CGCGATCGAC ATCCCCAACC GCAGCATCAA CCTCGACGTT
TCCGCCGACG AATTGGCGCG ACGCAGCGAA GAGGAGCAGG CGCGCGGCGA CAAGGCTTGG
CAGCCGAAGG ACCGCAACCG CGTGGTCTCT GCTGCACTGC AGGCCTACGC TGCGCTGACC
ACAAGCGCGG CGAACGGCGC AGTACGCGAC GTCAACCGCC GGCTGGGAAA AGGCAAGTAA
 
Protein sequence
MPVYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV 
AREIEAAGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MAAMRLNIPA VFVSGGPMEA GKVVLNGKTH AVDLIDAMVA AADSNMSDAD
VQVMERSACP TCGSCSGMFT ANSMNCLAEA LGLALPGNGS VLATHADRKR LFVEAGHTIV
DLARRYYEGD DESVLPRNVA SFKAFENAMT LDIAMGGSTN TVLHLLAAAR EAELDFSMKD
IDRLSRKVPC LSKIAPSVSD VHMEDVHRAG GIMAILGELD RAGLIHNSCP TVHSETLGAA
LARWDIRQSN SEAVRTFYRA APGGVPTQVA FSQDRRYDEL DLDRQKGVIR DAEHAFSKDG
GLAVLYGNIA EDGCIVKTAG VDASILTFSG PAKVFESQDD AVSAILGNKI VAGDVIVIRY
EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHVSP EAAEGGLIGL
VRDGDRVAID IPNRSINLDV SADELARRSE EEQARGDKAW QPKDRNRVVS AALQAYAALT
TSAANGAVRD VNRRLGKGK