Gene Rpal_4548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4548 
Symbol 
ID6412232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4899887 
End bp4901197 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content67% 
IMG OID642714428 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_001993517 
Protein GI192292912 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATCA CGATGATTGG GACGGGTTAT GTGGGGCTGG TGTCCGGGGC ATGCTTCGCG 
GACTTCGGCC ACCAGGTAAC CTGCGTCGAC AAGGATGCCG GCAAAATCGC GGCCCTGCAT
CGGGGCGAAA TTCCCATTTA CGAACCGGGC CTTGACGAGC TGGTCGCGGC CAACGTCAAG
GCTGGGCGGC TCGACTTTAC CACCGACCTG ACCGCCCCGG TGGGCGAAGC GGACGCAGTG
TTCATCGCCG TCGGGACCCC GTCACGGCGC GGCGACGGCC ACGCTGACCT ATCCTATGTG
TATGCGGCCG CAAAGGAAAT CGCCGCCGCC CTGAAAGGCT TCACGGTCGT GGTGACCAAG
TCGACGGTCC CGGTCGGCAC CGGCGACGAG GTCGAGCGGC TGATCCGCGA GACCAATCCC
ACCGCCGACG CGGCGGTCGC CTCGAACCCT GAATTCCTGC GCGAGGGCGC CGCGATCCGC
GACTTCAAGT TCCCCGACCG CATCGTGATC GGGACTGCCG ACGAGCGCGC CCGCAAAGTG
ATGGGCGAGA TCTACCGCCC GCTGTCGCTG AACCAGGGCC CGCTGATGTA CACCGCGCGG
CGCACCGCCG AGCTGATCAA ATACGCCGCT AACGCATTCC TGGCGACCAA GATTACCTTC
ATCAACGAGA TGGCGGACCT CGCCGAAAAG GTCGGCGCCG ACGTCCAGGA CGTCGCCCGC
GGCATCGGCA TGGACAACCG GATCGGCTCC AAATTCCTGC ATGCCGGCCC CGGCTTCGGC
GGCTCGTGCT TCCCCAAGGA CACCCGCGCG CTGGTGCAGA CCGCCCATGA CCACGACGTA
CCGGTGCGGA TCGTCGAGGC GGTCCTTGCC GTCAACGACA ACCGCAAGCG CGCAATGGCC
CGCAAGGTCT CGCACGCGCT CGGCGGCAAC ATGCGCGGCA AGACCATCGC GGTGCTCGGC
CTGACCTTCA AGCCGGACAC CGACGACATG CGCGAGGCGC CGTCGATCCC GCTCGTCACC
GGCCTCACCG ACATGGGCGC CAAGGTGAAG GCGTTCGATC CCGCCGGCAT GGCGCAGGCC
AAGGCGGAGT TGCCGGACAT CACCTACTGC GAGGACGCCT ACGACTGCGC CAAGGGCGCC
GACGCGCTAG TGATCGTCAC CGAATGGGTG CAATTCCGCG CGCTCGACCT GCCGCGGCTG
AAAGCCGCAA TGGCGCAGCC GATCGTCGTC GACCTGCGCA ACATCTACCG CCCCACCGAA
ATGGCCGAGC ACGGCTTCAG TTATCACAGC GTCGGCCGCG GCGACGCGTA G
 
Protein sequence
MRITMIGTGY VGLVSGACFA DFGHQVTCVD KDAGKIAALH RGEIPIYEPG LDELVAANVK 
AGRLDFTTDL TAPVGEADAV FIAVGTPSRR GDGHADLSYV YAAAKEIAAA LKGFTVVVTK
STVPVGTGDE VERLIRETNP TADAAVASNP EFLREGAAIR DFKFPDRIVI GTADERARKV
MGEIYRPLSL NQGPLMYTAR RTAELIKYAA NAFLATKITF INEMADLAEK VGADVQDVAR
GIGMDNRIGS KFLHAGPGFG GSCFPKDTRA LVQTAHDHDV PVRIVEAVLA VNDNRKRAMA
RKVSHALGGN MRGKTIAVLG LTFKPDTDDM REAPSIPLVT GLTDMGAKVK AFDPAGMAQA
KAELPDITYC EDAYDCAKGA DALVIVTEWV QFRALDLPRL KAAMAQPIVV DLRNIYRPTE
MAEHGFSYHS VGRGDA