Gene Rpal_4812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4812 
Symbol 
ID6412498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5179509 
End bp5180672 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content67% 
IMG OID642714690 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001993777 
Protein GI192293172 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCTC CGATCGTCTG GGTTTCCGGC ACGCTGAGCG CAGCCACCGC GCTGCTTGCC 
TCGCTGCTGA TCGTGACCGC CAGCCGCGGT GAGATCACCA CTTACGAATC CTCCGCAGGC
CCGCTGACCG TCCAGACCGT GGCGCAGAAG CTGGTACACC CCTGGGGTCT GGCGTTTCTG
CCTGATGGCC GGATGCTGGT GACTGAGCGC CCCGGCCGTC TGCGGCTGGT GACACCGCAG
GGCCAGGTCT CGAAGCCTCT GCAGGGCGTG CCGGAGGTGT GGGCCTCGGG CCAGGGCGGA
CTGCTCGACG TCGCGGCCGA CAAGGACATC GCCAGCAACC ACACGATCTA CCTGTGCTAC
GCCGAGCGCG ACGGCAATGG CGGCCGGACC GCGGTGGCAC GTGCGTCTCT CGACACCGGC
GATGCACCGC GGCTGAACGA CATCAAGGTG ATCTTCCGCC AGCAGGGGCC GCTGTCGTCC
GGCAATCACT ATGGCTGCCG GATCGCGCAG GACGGCAGCG GCAATCTGTT CGTGACGCTC
GGCGAGCACT ACGCGTATCG CGATCAGGCG CAGAGCCTGT CCAATCATCT GGGCAAGATC
GTCCGCATTG CGCCGGACGG CAGCGTGCCC GACGGCAATC CGTTCGCCGG CCGCGAGGGC
GCCGAGCCCG AACTCTGGAG CCTCGGCCAC CGCAATCCGC AGGGCCTCGC CTTCAACCCC
GCCGACGGCA AACTGTGGGA GGTCGAGCAC GGCCCGCGCG GCGGCGATGA GGTCAACATC
ATCCGCAAGG GTGAGAATTA CGGCTGGCCG GTGATCGGCT ACGGCATCGA CTATAACGGC
GCCAAGATCC ACGAGGCGAC CGCTAAGCCG GGCATGCAGC AGCCCGCCAA ATATTGGGTG
CCGTCGATCT CGCCGAGCGG GATGGCGTTC TACACCGGCA AGCTGTTTCC GACCTGGACC
GGCAGCCTGT TCGTCGGCGC GCTGTCGGGA CAGATGCTGG TGCGGCTGTC GCTCGACGGC
GACAAGATCA CCGGCGAAGA GCGGCTGTTG CAGACGCTGG ACGAACGCAT CCGCGACGTG
CGTCAGGGGC CGGACGGTGC GCTGTGGCTC TTGACCGACA GCGACACCGG ACGCCTTCTG
CGCGTCGTGC CAGCGGCCAA CTAA
 
Protein sequence
MKAPIVWVSG TLSAATALLA SLLIVTASRG EITTYESSAG PLTVQTVAQK LVHPWGLAFL 
PDGRMLVTER PGRLRLVTPQ GQVSKPLQGV PEVWASGQGG LLDVAADKDI ASNHTIYLCY
AERDGNGGRT AVARASLDTG DAPRLNDIKV IFRQQGPLSS GNHYGCRIAQ DGSGNLFVTL
GEHYAYRDQA QSLSNHLGKI VRIAPDGSVP DGNPFAGREG AEPELWSLGH RNPQGLAFNP
ADGKLWEVEH GPRGGDEVNI IRKGENYGWP VIGYGIDYNG AKIHEATAKP GMQQPAKYWV
PSISPSGMAF YTGKLFPTWT GSLFVGALSG QMLVRLSLDG DKITGEERLL QTLDERIRDV
RQGPDGALWL LTDSDTGRLL RVVPAAN