Gene RPB_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2100 
Symbol 
ID3908514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2387545 
End bp2388531 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content68% 
IMG OID637883993 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_485717 
Protein GI86749221 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.415706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000106106 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGCACATCC TCATCCTCGG CGCCGCCGGC ATGGTCGGGC GCAAACTCAC GGAGCGGCTG 
CTCGCCGACG GTCGCCTCGG TGATCGCGAG ATCACCCGGA TGACGCTGCA GGACGTGGTC
GCGCCGGCCA CGCCCGCCAA GGCGGCGATG CCGATCACGA CGATCGTCAG CGATCTCGCC
GAGCCGGGGC AGGCGGCGGC GCTGGTGGCG CATCGGCCGG AGGTGATCTT CCATCTCGCC
GCGATCGTCT CCGGCGAGGC CGAGGCCGAT TTCGACAAGG GCTACCGCAT CAATCTCGAC
GGCACGAGGC ATCTGATCGA CGCGATCCGC GCGGAAGGCG ACGACTATCA TCCGCGGCTG
GTGTTCACCT CGTCGATCGC GGTGTTCGGC GCGCCGTTCC CCGAGAAAAT CGGCGACGAA
TTCCTCTCCG CGCCGCTCAC CAGCTACGGC ACCCAGAAGG CGATCTGCGA ACTCTTGATC
GCCGACTATA CCCGCAAGGG CTTTCTCGAC GGCGTCGGCA TTCGCCTGCC GACGATCTGC
GTCCGCCCCG GCACGCCCAA CAAGGCGGCC TCCGGCTTCT TCTCCAACAT CATCCGCGAG
CCGCTCGCCG GCCACGAGGC GGTGCTGCCG GTCTCCGACG ACGTGATGCA CTGGCACGCC
TCGCCGCGCT CCGCGGTCAG CTTCCTGATC CATGCCGGCA CGATGGACAC GCAGGCGATC
GGCCCGCGCC GCAATTTGTC GATGCCCGGT CTCGCCGCCA CCGTCGGCGA ACAGATCGCG
GCGCTCGAAC GCGTCGCCGG CAAGGGCGTC GTGGCGCGGA TCAGGCGCGA GCCCGATCCG
GTGATCATGG GCATCGTCGC CGGCTGGCCG CGCAATTTTG CGACCGACCG CGCGCTCGCG
CTCGGCTTCA CCACCGCGGA ACAGAGCTTC GACGACATCA TCCGGATTCA CATCGAGGAC
GAACTGGGCG GGAATTTTGC CGCCTGA
 
Protein sequence
MHILILGAAG MVGRKLTERL LADGRLGDRE ITRMTLQDVV APATPAKAAM PITTIVSDLA 
EPGQAAALVA HRPEVIFHLA AIVSGEAEAD FDKGYRINLD GTRHLIDAIR AEGDDYHPRL
VFTSSIAVFG APFPEKIGDE FLSAPLTSYG TQKAICELLI ADYTRKGFLD GVGIRLPTIC
VRPGTPNKAA SGFFSNIIRE PLAGHEAVLP VSDDVMHWHA SPRSAVSFLI HAGTMDTQAI
GPRRNLSMPG LAATVGEQIA ALERVAGKGV VARIRREPDP VIMGIVAGWP RNFATDRALA
LGFTTAEQSF DDIIRIHIED ELGGNFAA