Gene RPB_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1884 
Symbol 
ID3908079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2154499 
End bp2156553 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content62% 
IMG OID637883778 
Productalpha amylase, catalytic region 
Protein accessionYP_485503 
Protein GI86749007 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.360227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTCCG GGAACTTTGC CGTTCCTGAG GCTTTGTTAC GAGACGACCG GGCGTTGTTC 
GACTTCCAGG GAAATAGGGC TACGGTCCTA TCGCATCACG TGAACAGATC GACTCAAGCA
TTCCAGACCG TCTCGGCTGG CGGCGCTTTC CAAATTGAAG ATATCTTCCC GATCATCGAC
AGCGGCCGAT TTCCCGTCAA GCGGGTTGTC GGCGAGCCCA TCGAGATCTG GGCGGACGTG
TATCGCGACG GTCACGAGGT GATCGCTGCG GCGCTGGTCT GGCGGCGCGA GCAGGATCAG
GATTGGCAGC GCGTCCCGAT GCGTCACGTC GTCAACGATC GCTGGACCGC CTCCTTCGTG
CCCGATCATA TTGGACGATA TGTCTATGCG ATCGAAGCGT GGACCGACGA ATTCGCGACC
TGGCGGCACG GCATCGAGCT GAAGCAGAAG GCGGGGCAAG ACGTCGCACT CGACGCGCTG
GAAGGCGCCG GGATGCTGAC CAAAGCTCTG AACGGCGGGC ACGAAGCGCT GACCGTTATT
CAGCGGCAAT GCGAGGACTA TCTGCAGACC GGCGACGTGG CGCCTTTGCT CACCCCCGAA
TTGCGCGACG CGATGGCCGA AAGCCAGGCG CGGCCGGACG TCACCCGCTC CGCGCTGCTG
CCGCTGATGA TCGACCGCGA ACGCGCCCGC TGCGGAGCGT GGTACGAGAT GGTGCCGCGC
AGCCAGAGCA CGGTGCCCGG CCAGCACGGC ACCTTCCGCG ACTGCATCGC AAGGTTGCCG
GACATCGCGG CGATGGGCTT CGATGTGCTG TATTTCACGC CGATCCACCC GATCGGCCGC
GTCAATCGCA AAGGCCGCAA CAACTCGCTG AAGGCCGACC CCGGCGATCC GGGCAGCCCC
TATGCGATCG GTGCTGAGGA AGGCGGCCAC GACGCGGTGC ATCCCGAGCT CGGCACGCTC
GACGATTTCC ACGCGCTGGT CGAAGCCTGC AAGCTGGTGA ACGTCGAGAT CGCGCTCGAC
ATCGCGGTGC AATGCTCGCC GGATCACCCG TGGCTGAAGC AGCATCCGGA CTGGTTCAAG
CGCCGTCCCG ACGGCTCGAT GAAATACGCC GAGAACCCGC CGAAGAAATA CGAGGACATC
GTCAATCCGG ATTTCACCTG CGAGGACGCC GGCTCGCTGT GGAATGCGCT GCGCGACGTG
ATCCTGTTCT GGGTCGACCA CGGCGTGAAG ATCTTTCGGG TGGACAACCC GCACACCAAG
CCGCTGCGGT TCTGGGAATG GCTGATCCAC GAGGTTCAGC TCGCTCACCC CGACGTGCTG
TTTCTCGCCG AGGCCTTCAC CCGGCCGAAG CTGATGAAGG GGCTCGCCAA ACTCGGCTTC
AGCCAGTCCT ACACCTACTT CACCTGGCGG ACGCAGAAGT GGGAGATCGA GGAATACTTG
CGCGAACTGA CAGGCTATCC GGAGCGTGAT TTCTACCGCC CGAATTTCTT CGTCAATACC
CCGGACATCC TCCCCTACCA TCTGCAGGGC GGCGAGCCGT GGATGTTCAA ATCCCGCCTT
GCGCTCGCCG CGATGCTGTC ATCGACCTAT GGCATCTATA ACGGCTTCGA GCTGATCGAA
CACGCGCCGA TCCCCGGCAA GGAGGAGTAT CTCGATTCCG AAAAATACGA GATCAAGGTC
CGCGACTGGG ACAGACCGGG CAATATCAAG CCTTATATCC GCGAGATCAA TCAGGCCCGC
CGCAACAATC CGGCGCTTCA GCAGACGAGC AATCTTCGCT TCGTGGATAT CCAGGATCCC
AACGTGACCG GCTTCGTCAA ATACTCAACG GACCTGACCA ACGTCGTGGC CGTCGCGATC
GCGCTGTCGC GCGATTTCCA TGAATTCTGG TTTCCGCTCG GCGACACCCA GGTCGAGGTC
GGCGGCGAGC GCCGCCCGGT CGTGGCGGTC GAGAATCTGC TCACCGGCGA GCGGCACGCC
GTGGAATGGG GCGGCCTCAC GCTACGGATC GATCCGCAAC GCGATCCTGC GCTGCTTTTC
CGCTGCCTGG CGTGA
 
Protein sequence
MSSGNFAVPE ALLRDDRALF DFQGNRATVL SHHVNRSTQA FQTVSAGGAF QIEDIFPIID 
SGRFPVKRVV GEPIEIWADV YRDGHEVIAA ALVWRREQDQ DWQRVPMRHV VNDRWTASFV
PDHIGRYVYA IEAWTDEFAT WRHGIELKQK AGQDVALDAL EGAGMLTKAL NGGHEALTVI
QRQCEDYLQT GDVAPLLTPE LRDAMAESQA RPDVTRSALL PLMIDRERAR CGAWYEMVPR
SQSTVPGQHG TFRDCIARLP DIAAMGFDVL YFTPIHPIGR VNRKGRNNSL KADPGDPGSP
YAIGAEEGGH DAVHPELGTL DDFHALVEAC KLVNVEIALD IAVQCSPDHP WLKQHPDWFK
RRPDGSMKYA ENPPKKYEDI VNPDFTCEDA GSLWNALRDV ILFWVDHGVK IFRVDNPHTK
PLRFWEWLIH EVQLAHPDVL FLAEAFTRPK LMKGLAKLGF SQSYTYFTWR TQKWEIEEYL
RELTGYPERD FYRPNFFVNT PDILPYHLQG GEPWMFKSRL ALAAMLSSTY GIYNGFELIE
HAPIPGKEEY LDSEKYEIKV RDWDRPGNIK PYIREINQAR RNNPALQQTS NLRFVDIQDP
NVTGFVKYST DLTNVVAVAI ALSRDFHEFW FPLGDTQVEV GGERRPVVAV ENLLTGERHA
VEWGGLTLRI DPQRDPALLF RCLA