Gene RPB_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4066 
Symbol 
ID3911873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4638581 
End bp4640440 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content67% 
IMG OID637885970 
Productdihydroxy-acid dehydratase 
Protein accessionYP_487670 
Protein GI86751174 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.405184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCAT ATCGATCCCG AACGACCACT CACGGCCGCA ACATGGCCGG CGCGCGCGGC 
CTCTGGCGCG CCACCGGGAT GAAGGATTCC GACTTCGGCA AGCCGATCAT CGCCGTCGTC
AACTCCTTCA CGCAGTTCGT GCCGGGGCAC GTGCATCTGA AGGATCTCGG CCAGCTCGTC
GCCCGGGAGA TCGAGGCCGC CGGCGGCGTC GCCAAGGAAT TCAACACCAT CGCGGTCGAC
GACGGCATCG CGATGGGCCA TGACGGCATG CTGTACAGCC TGCCGTCGCG CGAACTGATC
GCCGACAGCG TCGAATACAT GGTCAACGCG CACTGCGCCG ACGCGATGGT CTGCATCTCG
AACTGCGACA AGATCACCCC CGGCATGCTG ATGGCCGCGA TGCGGCTCAA CATCCCAGCG
GTGTTCGTCT CCGGCGGGCC GATGGAAGCG GGCAAAGTGG TGTTGAAGGG CAAGACCCAC
GCCGTCGACC TGATCGACGC GATGGTCGCG GCCGCCGACA GCAGCATGAG CGACGAAGAC
GTGCAGACGA TGGAGCGCTC GGCGTGCCCG ACCTGCGGCT CCTGCTCCGG CATGTTCACC
GCCAATTCGA TGAACTGTCT CGCCGAGGCG CTGGGTCTGG CGCTGCCCGG CAACGGCTCG
GTGCTCGCCA CCCATGCCGA TCGCAAGCGG CTGTTCGTCG AGGCCGGTCA CACCATCGTC
GATCTGGCGC GGCGCTACTA CGAAGGCGAC GACGAATCCG TGCTGCCGCG CAAGGTCGCG
AGCTTCGAGG CGTTCGAGAA CGCGATGACG CTCGACATCG CGATGGGCGG CTCGACCAAC
ACGGTGCTGC ATCTGCTCGC CGCGGCGCGC GAGGCCGAAC TCGACTTCTC GATGAAGGAC
ATCGACCGGC TGTCGCGCAA GGTGCCGTGC CTGAGCAAGA TCGCCCCGTC GGTGTCCGAC
GTCCACATGG AGGACGTGCA TCGCGCCGGC GGCATCATGG CGATCCTCGG CGAGCTCGAT
CGCGCCGGAC TGATCCACAA CTCATGCCCG ACTGTGCATT CGGAGACGCT CGGTGCCGCG
CTGGCGCGCT GGGACATCCG CCAGAGCAAC AGCGAAGCGG TCCGCACCTT CTACCGCGCC
GCGCCGGGCG GCGTGCCGAC CCAGGTTGCG TTCAGCCAGG ACCGCCGCTA CGACGAGCTC
GACCTCGACC GGCAGAAGGG CGTGATCCGC GACGCCGAGC ACGCCTTCAG CAAGGACGGC
GGCCTCGCGG TGCTGTACGG CAATATTGCG CTCGACGGCT GCATCGTGAA GACCGCCGGC
GTCGACGCCT CGATCCTGAC CTTCTCCGGC CCGGCGAAAG TGTTCGAGAG CCAGGACGAC
GCGGTGTCGG CGATCCTCGG CAACAAGATC GTCGCTGGCG ACGTCATCGT GATCCGCTAC
GAAGGGCCAC GTGGCGGGCC GGGCATGCAG GAGATGCTGT ATCCGACCAG CTATCTGAAG
TCGAAGGGCC TCGGCAAAGC CTGCGCGCTG ATCACCGACG GCCGCTTCTC CGGCGGCACC
TCGGGCCTGT CGATCGGTCA CGTCTCGCCC GAGGCTGCGG AAGGCGGCCT GATCGGACTG
GTGCGGAACG GCGACCGGAT TTCGATCGAC ATTCCCAATC GCGGCATCAC CCTTGACGTC
GCCGCTGACG AGCTGTCGCG GCGCGCCGAG GAGGAAGAGG CGAAGGGCGA CAAGGCCTGG
CAGCCGAAAG ACCGCAAGCG CAAGGTCTCG GCCGCGCTGC AGGCCTATGC CATGCTGACC
ACCAGCGCTG CGAACGGCGC GGTGCGCGAC GTCAACCGCA GGCTCGGCAA AGGAAAGTAG
 
Protein sequence
MPAYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV 
AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MAAMRLNIPA VFVSGGPMEA GKVVLKGKTH AVDLIDAMVA AADSSMSDED
VQTMERSACP TCGSCSGMFT ANSMNCLAEA LGLALPGNGS VLATHADRKR LFVEAGHTIV
DLARRYYEGD DESVLPRKVA SFEAFENAMT LDIAMGGSTN TVLHLLAAAR EAELDFSMKD
IDRLSRKVPC LSKIAPSVSD VHMEDVHRAG GIMAILGELD RAGLIHNSCP TVHSETLGAA
LARWDIRQSN SEAVRTFYRA APGGVPTQVA FSQDRRYDEL DLDRQKGVIR DAEHAFSKDG
GLAVLYGNIA LDGCIVKTAG VDASILTFSG PAKVFESQDD AVSAILGNKI VAGDVIVIRY
EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHVSP EAAEGGLIGL
VRNGDRISID IPNRGITLDV AADELSRRAE EEEAKGDKAW QPKDRKRKVS AALQAYAMLT
TSAANGAVRD VNRRLGKGK