Gene RPD_3482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3482 
Symbol 
ID4023996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3865277 
End bp3867232 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content64% 
IMG OID637963686 
Productalpha amylase, catalytic region 
Protein accessionYP_570606 
Protein GI91977947 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAGAT CGACCCAAGT GTTCCAGACC GTCGCGGCAG GCGGTGCTTT CCACATTGAA 
GATATCTTCC CGATCATCGA TAGCGGCCGT TTCCCCGTTA AAAGGGTCGT CGGCGAGCCG
ATCGAGGTCT GGGCGGATAT CTATCGCGAC GGCCATGAGG TGATCGCAGC GGCGCTGATC
TGGCGGCGCG AGCAGGATCA GGACTGGCAG CGCGCGCCGC TGCGGCATGT CGTCAACGAC
CGCTGGACGG CCACCTTCAC GCCCGACCAG ATTGGACGAT ATGTCTATGC GATCGAGGCC
TGGACCGACG AATTCGCGAC TTGGCGACAT GGGGTCGAGC TGAAGCTGAA GGCCGGCCAG
GACGTCACGC TCGACGCGCT GGAAGGCGCA GGCCTGCTGA CCAAGGCGCA GCACGGCGGA
GCGGAGGCGC TCGCGATCGT CCACCGCCAA TGCGACGAAT ATCTGCAGAC CGGCGAGGTC
GGCCCCCTGC TCGCGCCCGA GCTGCGCGAC GCGATGGGCG AAAGCCAGGC GCGGCTTGAT
GTCACGCGGT CGGCTCTGCT GCCGCTGATG ATCGACCGCG AGCGCGCCCG CAACGGCGCC
TGGTACGAGA TGGTGCCGCG CAGCCAGAGC ACCGTCCCCG GCCAGCACGG CACCTTCCGC
GATTGCATCG CCCGGCTGCC GGACATCGCC GCGATGGGCT TCGACGTGCT GTATTTCACG
CCGATCCATC CGATCGGGCG CGTCAACCGC AAGGGGCGCA ACAATTCGCT CAAGGCCGAG
GCCGGCGACC CCGGCAGCCC CTATGCGATC GGCGCGGAGG AAGGCGGCCA CGACGCGGTG
CATCCCGAGC TCGGCACGCT CGACGACTTC CACGCCTTGC TGAAGGCGTG CAAGCTGGTC
AATCTCGAGA TCGCGCTCGA CATCGCCGTG CAATGCTCGC CGGATCATCC CTGGCTGAAG
CAGCATCCGG ACTGGTTCAA GCGCCGGCCC GACGGCTCGA TGAAATACGC CGAGAACCCG
CCGAAGAAAT ACGAGGACAT CGTCAATCCG GACTTCACCT GCGAGGACGC CGGCTCGCTG
TGGAATGCGC TGCGCGACGT CATCCTGTTC TGGGTCGACC AGGGCGTGAA GATCTTCCGG
GTCGACAATC CGCACACCAA ACCGCTGCGG TTCTGGGAAT GGATGATCCG CGAGGTGCAG
CTCCGCCATC CCGACGTGCT GTTCCTGGCA GAAGCCTTCA CCCGGCCGAA GCTGATGAAG
GGCCTGGCCA AGCTCGGCTT CAGCCAGTCC TACACCTATT TCACCTGGCG GACCCAGAAA
TGGGAAATCG AGGAATACCT GCGCGAGCTG ACCGGATATC CGGAGCGTGA CTTCTATCGG
CCGAACTTCT TCGTCAACAC CCCGGACATC CTGCCTTTCC ATTTGCAGGG CGGCGAACCG
TGGATGTTCA GGTCGCGCGT CGCGCTCGCC GCAACGCTGT CATCGACCTA TGGCATCTAT
AGCGGCTTCG AACTGCTCGA ACACGAGCCG ATTCCCGGCA AGGAGGAGTA TCTCGATTCC
GAAAAATACG AGATCAGGGT GCGTGACTGG GACAAGCCCG GCAACATCAA GCCCTATATC
CGCGCGATCA ACAGCGCCCG CCGCGCCAAT CCGGCGCTTC AGCAGACCAG CAATCTGCGC
TTCGTCGACA TTCAAGACGC AAACGTGACC GGCTTCATCA AACAATCGCC CGATTTGAGC
AACGTCGTAG CTGTCGCCAT CGCCTTGTCG CGCGACTTCC ACGAATTCTG GTTTCCGCTC
GGCGATGTTC AGGTCGAGAT CGGCGGCGAG CGCCGTCCGG TCGCGGCCGT CGAGAACCTG
CTCACCGGCG AACGGCACGC CGTCGAATGG GGTGGACTTA ACCTGCGGAT CGACCCGCAA
CGCGATCCGG CGCTGCTTTT CCGCTGCCTG GCGTGA
 
Protein sequence
MNRSTQVFQT VAAGGAFHIE DIFPIIDSGR FPVKRVVGEP IEVWADIYRD GHEVIAAALI 
WRREQDQDWQ RAPLRHVVND RWTATFTPDQ IGRYVYAIEA WTDEFATWRH GVELKLKAGQ
DVTLDALEGA GLLTKAQHGG AEALAIVHRQ CDEYLQTGEV GPLLAPELRD AMGESQARLD
VTRSALLPLM IDRERARNGA WYEMVPRSQS TVPGQHGTFR DCIARLPDIA AMGFDVLYFT
PIHPIGRVNR KGRNNSLKAE AGDPGSPYAI GAEEGGHDAV HPELGTLDDF HALLKACKLV
NLEIALDIAV QCSPDHPWLK QHPDWFKRRP DGSMKYAENP PKKYEDIVNP DFTCEDAGSL
WNALRDVILF WVDQGVKIFR VDNPHTKPLR FWEWMIREVQ LRHPDVLFLA EAFTRPKLMK
GLAKLGFSQS YTYFTWRTQK WEIEEYLREL TGYPERDFYR PNFFVNTPDI LPFHLQGGEP
WMFRSRVALA ATLSSTYGIY SGFELLEHEP IPGKEEYLDS EKYEIRVRDW DKPGNIKPYI
RAINSARRAN PALQQTSNLR FVDIQDANVT GFIKQSPDLS NVVAVAIALS RDFHEFWFPL
GDVQVEIGGE RRPVAAVENL LTGERHAVEW GGLNLRIDPQ RDPALLFRCL A