Gene RPC_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2055 
Symbol 
ID3973974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2248756 
End bp2249886 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content63% 
IMG OID637925164 
Product2-hydroxyglutaryl-CoA dehydratase, D-component 
Protein accessionYP_531929 
Protein GI90423559 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.574959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.736744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGAC CGGATTCCAT CGCTGCCGCG GTGCAGGATG TCCTCGACCT GCACCGCGAT 
CGCCTCGCGC TCCTCGCCGG GATCGACCGG CCGAAGATCG GTTACCTGTC GATTCAGACC
CCCGAGGAAA TCCTGCTGGC CGCGGGCGCG ATCCCGTTCC GGCTGACCGG CGAGTACAGC
ACCGAGACCG ATAGGGCGAG CGTGCATCTT GGCAGCAACT ACTGTTCCTA CGTGCTAAGC
TGCTTCGGCG AGGGACTCGC CGGCGTCTAT GGCTTCGCGG ATGCGGTCGT GTTCGTCGAT
GCCTGCGATA TGCGCAAGCG GCTGTGGGAG ACCTGGGCTC GGACCGTTCC CGGTTGTGAA
AGCTGGTTCC TCGAACTTCC CAACGACGCG AGTCCATTGT CGAAATCGTT CTTCGCCGGT
CAGCTTCGCA AGCTGATCCG CCGGCTGGAG CAACGCTACG GCCGGCCGAT CGGCGAGGGC
GCGCTGCGCG ACGGCATCGC GCGGTGCAAC CGCACGCGCG AGCTGATGCA GCGGCTCTAC
GACGGCAACA AGCGCGGCGG CCCGCTGCTG ACGGGCCAGC AGTCGATCGA ACTCGTGAAG
GCGGCAACAA CGGGGCTGAA GGACGAATTC AACGATCGCC TGGCGGCGCT GCTCGACGCC
GCGGCTTCGC AGCAACCCCG GGCCGGTCGG CAGCGGCACC GGGTGTTGAT CTGCGGCAGC
TATTTCGACC AACCGGCGAT CGTGGAGGTC ATCGAGGCCA CCGGCGCCGA CATCGTCTGC
GGGGATATCA GCAACGGGGT CAAGTATTTC GAAGGCAGGA TCGATGCCGA CGCGGAGCCG
GTTGCCGCCA TCGCCGACTA CTATCTCGAA AAGCACACCA GCGCGCGCTG CATCGATACC
GACATCCGGC TGCGGCACCT GTTCGACTTG GTGCGGGACT ATCGCGTGGA ATCGGTGATC
TACTTCGCCC TGAAATTCTG TGACACCAAT CTGCACGACT ACCCCTACAT CAAGGAAAAA
TTGCGCGAGC AGAAGATCCC CGTTCTGTTC ATCGAAGGCG AGCACAACGG CAGCAATATC
GCCAGCATCA AGACGCGCAT CGAGACGTTC CTGGAGCCGC GGTTTTTCTG A
 
Protein sequence
MTRPDSIAAA VQDVLDLHRD RLALLAGIDR PKIGYLSIQT PEEILLAAGA IPFRLTGEYS 
TETDRASVHL GSNYCSYVLS CFGEGLAGVY GFADAVVFVD ACDMRKRLWE TWARTVPGCE
SWFLELPNDA SPLSKSFFAG QLRKLIRRLE QRYGRPIGEG ALRDGIARCN RTRELMQRLY
DGNKRGGPLL TGQQSIELVK AATTGLKDEF NDRLAALLDA AASQQPRAGR QRHRVLICGS
YFDQPAIVEV IEATGADIVC GDISNGVKYF EGRIDADAEP VAAIADYYLE KHTSARCIDT
DIRLRHLFDL VRDYRVESVI YFALKFCDTN LHDYPYIKEK LREQKIPVLF IEGEHNGSNI
ASIKTRIETF LEPRFF