Gene RPC_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3643 
Symbol 
ID3972014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4051718 
End bp4052704 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content67% 
IMG OID637926752 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_533497 
Protein GI90425127 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.611335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.301343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACATCC TCATTCTCGG CGCCGCCGGC ATGGTCGGGC GCAAACTGAC CGAGCGGCTA 
TTGCGCGACG GTCATCTCGG CGACCGCGCC ATCACCAGGT TGACGCTGCA GGACGTGGTG
GCGGCGCCGA AGCCGCTCGA TGCGACGATT CCGGTCACCA TCGTCACCTC GGATTTCGCC
GATCCGCTGG GGGCCGCGCC GCTGGTAGCG TGCTGTCCGG AGGTGATCTT CCATCTCGCG
GCGATCGTGT CCGGCGAGGC CGAGGTCGAA TTCGACAAGG GCTACCGCAT CAATCTCGAC
GGCACCCGCT ATCTGCTTGA GGCAATCCGG GCGATCGGCG ACGGCTACCG GCCGCGGCTG
GTGTTCACCT CGTCGATCGC GGTGTTCGGC GCGCCGTTCC TCGACAAGAT CGGCGACGAG
TTCTTTCACA CCCCGCTGAC CAGCTACGGC ACCCAGAAAT CGATCTGCGA ATTGCTGCTG
GCGGATTACA GCCGCAAGGG CTTTGTCGAC GGCATCGGCA TCCGGCTGCC GACGATCTGC
GTCCGCCCGG GCAAGCCGAA CAAGGCGGCG TCGGGTTTCT TCTCCAATAT CATCCGCGAG
CCGCTGGCGG GCCACGAGGC GGTGCTGCCG GTGTCGGATG ACGTGCGGCA CTGGCACGCC
TCGCCGCGCT CCGCGGTGGG CTTCCTGCTG CACGCCGCGA CCATGGATCT GAAGGCGATG
GGGCCGCGGC GCAATCTGTC GATGCCCGGG CTTTCGGTGA CGGTCGGGGA ACAGATTGCA
GCCCTCGCGC GGGTGGCGGG GCAGGGCGTC GTCGCGCGGA TCAGGCGCGA GCCGGATCCG
GCGATCATCG GCATCGTCGC CGGCTGGCCG CGCGACTTTT CCACCGACCG CGCGCAAAGC
CTCGGCTTCA GCACCGCGGA AAACACCTTC GACGACATCA TCCGGATTCA CATCGAGGAT
GAACTCGAAG GCGAGTTCGT GCGGTAG
 
Protein sequence
MHILILGAAG MVGRKLTERL LRDGHLGDRA ITRLTLQDVV AAPKPLDATI PVTIVTSDFA 
DPLGAAPLVA CCPEVIFHLA AIVSGEAEVE FDKGYRINLD GTRYLLEAIR AIGDGYRPRL
VFTSSIAVFG APFLDKIGDE FFHTPLTSYG TQKSICELLL ADYSRKGFVD GIGIRLPTIC
VRPGKPNKAA SGFFSNIIRE PLAGHEAVLP VSDDVRHWHA SPRSAVGFLL HAATMDLKAM
GPRRNLSMPG LSVTVGEQIA ALARVAGQGV VARIRREPDP AIIGIVAGWP RDFSTDRAQS
LGFSTAENTF DDIIRIHIED ELEGEFVR