Gene Rru_A3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3302 
Symbol 
ID3836749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3800703 
End bp3802073 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content69% 
IMG OID637827418 
Productglucan 1,4-alpha-glucosidase 
Protein accessionYP_428384 
Protein GI83594632 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.219547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGG CGCCTTCGGC CACCGGCCTT GAAGACTGGC TGACCACGCA AATCCCGCGC 
TCAGCCGAAT GGATGCTGAG CGCGGTGTCG CGCGTCGATC TGGTCAAGGA GCGGCCGCTG
TTTGGTCAAA GGATCGTGCC GCTGCCCGGC TCGATCCTGG CCTCGCCGGT CGATGCCGCC
TGGGACCCGG AGCCCGATTA TTTCTTCCAC TGGTTCCGCG ACGCGGCGGT GATCCTCGAC
GCCATCCGCG TGCTCGCCCT CGACGGGATC CTGAGCGCGG CACGCGCCAA GGCCCTGCTC
CACGACAGCC TGACCTTCGC CCAAACCGTC GCCAGGCTGC AAGGCGCGGC GCTGGCCGAT
GGCGCCTACC GGACGGCGAC CCGCCCCGAG GGGGTGCAAT ACCTGCGCGA TCCCGCCAGC
ATCGCCGGGA TCTGCGGCTT GCGCGTCGCC GCGGAAACCC GGGTCAATGC CGATGGCACC
TTCGATATCA CCACCTGGGC CCGGCCGCAG ACCGATGGCC CGGCGCTGCG CGCGCTCACC
CATCTGCGCT GGGATGCCCA AGGGACGGTC GACGAGGCCG ACCGTCCCTT GCTCCAGGCC
TTGATCGCCG CCGATCTCGC CGTGGTCGAG GCGCTGTGGG CCGAGCCGTC GTTCGACCCT
TGGGAGGAGG AATGCGGCAC CCATTACTAT ACCCGCCTGC TTCAGGCCGA AGCCCTGGAG
CGTGGCGCCG ATTGGCTCGC TGGCGGGGCG ACCGACGCCT CGCCAACCGA AGCCAAAACC
CAGGCCGACC GGCTGCGCCA GACCGCCCGC ACCATCCTCG ATCAACTTGA AAGCCATTGG
ACCGGCGACA TTCTGGTCTC GCGCCAGGGG ATCGAGGGCG GGGGCGATCC GGGAAAGCTC
CTGGATATCG CGGTGATCTT GGGGGTTGTC CACGCGGCGC GCGAGGGCGA TCGCCATGGC
CTGCTCGATC CGCGGATCGA AGCGACCTTC GCCGCCCTCG AAGCGCTGTT TCGCGCCGAT
TATCCGATCA ACCACGCCCT GCCCCCCGGG CGCGGGCCGG CTTTCGGGCG CTATCGCACC
GATGCCTATT TCAGCGGCGG GGCCTTCTAT TTCTCGACCT TCGGCGCGGC CGAATACCAT
TACCACCGCG CCCGACTGCG CAACGACCCG GCCAGTCTGG CCGCCGGCGA CGCCATTCTG
GCCACAACCC GCGCCTATGC GCCCGCCGAT GGCGACATGG CCGAACAATT CGACAAGACC
ACCGGCGCCC AATCCTCGGC CCGCACCCTG GCCTGGAGCC ACGCCGGGCT GATCACCGCC
GCCAGCGCCC GACGGCGGGC ACAAGACCAT CTGGGCGGCA TTCTCCAATA G
 
Protein sequence
MSEAPSATGL EDWLTTQIPR SAEWMLSAVS RVDLVKERPL FGQRIVPLPG SILASPVDAA 
WDPEPDYFFH WFRDAAVILD AIRVLALDGI LSAARAKALL HDSLTFAQTV ARLQGAALAD
GAYRTATRPE GVQYLRDPAS IAGICGLRVA AETRVNADGT FDITTWARPQ TDGPALRALT
HLRWDAQGTV DEADRPLLQA LIAADLAVVE ALWAEPSFDP WEEECGTHYY TRLLQAEALE
RGADWLAGGA TDASPTEAKT QADRLRQTAR TILDQLESHW TGDILVSRQG IEGGGDPGKL
LDIAVILGVV HAAREGDRHG LLDPRIEATF AALEALFRAD YPINHALPPG RGPAFGRYRT
DAYFSGGAFY FSTFGAAEYH YHRARLRNDP ASLAAGDAIL ATTRAYAPAD GDMAEQFDKT
TGAQSSARTL AWSHAGLITA ASARRRAQDH LGGILQ