Gene Rru_A1605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1605 
Symbol 
ID3835022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1893256 
End bp1896558 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content63% 
IMG OID637825697 
ProductAlpha amylase, catalytic region 
Protein accessionYP_426692 
Protein GI83592940 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACT TCCTATGGTA CAAGGACGCC ATCGTCTATC AGCTTCATGT AAAAGCCTTT 
TTTGATTCGA ACGGTGACGG GTTCGGCGAT TTCAAGGGGC TGACCGAAAA GCTGGATTAC
CTCTCCGATC TTGGGGTGAC GGCGGTGTGG ATCTTGCCGT TCTATCCCTC GCCGCTGCGC
GATGACGGCT ATGACATCGC CGATTACAAG GCCATCCACC GCGCCTATGG CACCATGGGC
GATTTCCGCC GCTTCGTGCG CGAGGCCCAT GAGCGCGGGT TGAAGGTCAT CACCGAACTG
GTGATCAATC ACACCAGCGA TCAGCACGCC TGGTTCCAGC GCGCCCGCGA AGCCAAGCCC
GGCAGCAAGG CCCGCGACAT GTACGTGTGG TCCGACACCG ACCAGCGCTA TTTGGATACG
CGCATCATTT TCACCGATAC CGAAACCTCG AACTGGGCCT GGGATCCGCT GGCCAAGGCC
TATTACTGGC ACCGCTTCTT CGCCCATCAG CCCGATCTCA ACTGGGACAA TCCGCGAATT
TTCGAGGAGA TCACCCGGGT GATGAAGTTC TGGCTTGATT GCGGCGTCGA TGGCATGCGC
CTGGACGCCA TTCCTTATTT GGTCGAACGC GACGGCACGA ATAACGAAAA CCTGCCCGAA
ACCCATGTCG TCCTCAAGCG GCTGCGCGCT TGGCTTGATG CCCATTATGA AGACCGGATG
TTCCTGGCCG AGGCCAATCA ATGGCCCGAG GACGTGCGCG AGTATTTCGG CGAAGCCGGC
GACGAATGCC ACATGGCCTT CCACTTTCCG GTGATGCCGC GCATCTATAT GGCCGTGGCC
CAGGAAGACC GCCATCCGAT CACCGATATC ATGCGCCAGA CCCCGGAGAT CCCAGCGGCT
TGCCAATGGG CGATCTTCCT GCGCAATCAC GATGAACTGA CGCTCGAAAT GGTCACCGAC
CGCGAGCGCG ACTATATGAA CACCTTCTAT GCCAATGATT CGCGGGCCCG AATCAACGTC
GGCATCCGTC GGCGCCTGGC GCCTTTGCTT GATAACGATC GCCGCAAGAT CGAATTGCTC
AACAGCCTTT TGATGTCGAT GCCGGGTACG CCGATCATCT ATTACGGCGA CGAGATCGGC
ATGGGCGACA ATATTTTCCT GGGCGATCGC GATGGCGTGC GCACGCCCAT GCAATGGTCG
CCCGACCGCA ACGGCGGCTT TTCGCGGGCC GATCCCGCCT CGCTTTATCT GCCGACGATC
ATGGATGCGG TCTATGGCTT CTTCGCCGTC AATGTCGAGG CCCAGTCGCG CTCGCCATCC
TCGCTGCTCA ACTGGATGCG CCGGCTGATC GCCGTGCGCA AGCGCCACCC CTCCTTCGGG
CGCGGCACCT TGCGCTTTCT CTATCCCGGC AACCGCAAGA TCCTGGCTTA CTTGCGGGAA
TTCGAGGGCG AGGTGATCTT GTGCGTGGCC AATCTGTCGC GTGCCCCGCA GCCGGCCGAA
TTGGGGCTGG CCGAGTTTTC CGGGCGGGTT CCGGTCGAGA TGCTGGGCAA CAGCGTCTTT
CCGCCGATCG GCGAGTTGCA CTATTTCATC ACCCTGCCGG CCTATGGCTT CTATTGGTTC
CGCCTGTCGA AGGGCGAGGG GCCGAGCTGG CATGAACCGC CGGTCGAGCC CTTGCCCGAT
CTTTCGACCC TGGTTCTTCC CCGTATCTGG GACAGCCTGG GCAGCGATGG GCCCTTGCGC
CAGATCCAGG GGGATATCCT GCCGGCCTTC CTGCCCAAGC AACGCTGGTT CTCTGGCAAG
GAGCATTCCC TGCGCGGCGT CGAGATGACC GACCACGCGG TGATTTCGGC CCCCGGCGCC
GCTTCGGGCT GGATGCTCGC CCTGTGGCGC GCCGATTACA AGGATGGCGC CCCGGGGCAA
ACCTTCCAAT TGCCCCTTGA TATCGCCTGG GAAAGTCGCG AGGACGATCC CTTGTCCCGG
CTGCTGCCCT TTACCTTGGC CCGGGTGCGG CGGGTCAACC GCGTTGGCGC CCTCTATGAC
GCCATGGCCG GCCCGGCTTT CCCGCTCGCC TTGGTGCAGG CGATGGCCGA AGGCGCGAAC
GTGACGACGG CCAAGGGCGG CGGTTTGCGC TTCACCCGGA CCCAGGCCTT CCCCGCCGGC
CCGCTTCCCG GCCCCGAAGC GGTGGCCCGC CTGGGGCGTG AGCAATCCAA CACCTCGATC
AAACTTGGCG AGGATATGGT TCTCAAGCTC TATCGCCGGG TCGAGGCCGG GATTCATCCA
GAAATCGAAA TCGGCCGCTT CCTGACCGAT GTCGCCGGCT TCGCCAACGC CCCGCCGCTG
CTTGGCGCGC TCGAGCATAT CGATGCTAAG GGCGCCATCT CGGCCCTGGC GGTGCTGCAG
GGCTTCGTGC GCAATCAGGG CGATGGCTGG GACTACATGC TGGCCTATCT CGACCGCTTC
CTCGACGACA GGGCCCAGGC GCCGGCCGAG GTCGGCGAGG GCGGCGGGCC GGCCGGCTCG
CCCCATGGCA TCATCCACGC CCAGGCCGAT ATCCTTGGCC AGCGCGTCGC CGAGCTTCAC
CACGCCTTCG CCACCCCGAC CGAGGACCCG GCCTTTTCCG CCGAAGCCGT CGGGCCGGAG
GATCTGGCGA GCTGGCGCGA GCAAGTGCGC ACCCAGGCGG CCCAGGCCCG TCAGGCCCTT
GACACCGCCT TGCCCGGGCT GTCCGCAGAG GTGGCGGGGC TGGTCACCCG ACTGCTGGAC
TGCTGGGAGC GGATCGACGC CCGCGTCGAC GCGCCGATTT CTTTGGCCGA GGGGCTGGTC
AAAACCCGCA TCCATGGCGA CCTTCATCTG GGGCAGGTGG TGGTGGTGCG CGATGACTTC
CATATTCTCG ATTTCGAGGG CGAGCCGGTG AAGGGACTGA GTGAGCGGCG CGACAAACAC
TGCCCGTTGA AAGACGTGGC CGGCATGCTT CGCTCGTTCG ATTACGCGGC GTGGTCGGCG
GCGCTGAGCT TCCGCCAAAC CCATGCCGAG GTGCGGATCG ACGTACTGCC CGCGCTTGGC
GTTTGGCAGG AGGAAATCCG CGAGGCTTTC CTCGCCGGCT ATGACCGGGC CATCGCCGGA
TGTCCCTCGG TTCCGGCCCT TGAGGAGGGG CGAAGGGACC TCCTTTCACT ATTCATGCTG
GAAAAAGCCC TCTATGAGGT ATCTTACGAA GCCGCCAATC GTCCGGACTG GTTACGAATT
CCCATCGGCG GCGTTTTGCG CCTGCTTGGA GAGGGGCCAT CGGACGAGAT CAAACCGCGT
TGA
 
Protein sequence
MSDFLWYKDA IVYQLHVKAF FDSNGDGFGD FKGLTEKLDY LSDLGVTAVW ILPFYPSPLR 
DDGYDIADYK AIHRAYGTMG DFRRFVREAH ERGLKVITEL VINHTSDQHA WFQRAREAKP
GSKARDMYVW SDTDQRYLDT RIIFTDTETS NWAWDPLAKA YYWHRFFAHQ PDLNWDNPRI
FEEITRVMKF WLDCGVDGMR LDAIPYLVER DGTNNENLPE THVVLKRLRA WLDAHYEDRM
FLAEANQWPE DVREYFGEAG DECHMAFHFP VMPRIYMAVA QEDRHPITDI MRQTPEIPAA
CQWAIFLRNH DELTLEMVTD RERDYMNTFY ANDSRARINV GIRRRLAPLL DNDRRKIELL
NSLLMSMPGT PIIYYGDEIG MGDNIFLGDR DGVRTPMQWS PDRNGGFSRA DPASLYLPTI
MDAVYGFFAV NVEAQSRSPS SLLNWMRRLI AVRKRHPSFG RGTLRFLYPG NRKILAYLRE
FEGEVILCVA NLSRAPQPAE LGLAEFSGRV PVEMLGNSVF PPIGELHYFI TLPAYGFYWF
RLSKGEGPSW HEPPVEPLPD LSTLVLPRIW DSLGSDGPLR QIQGDILPAF LPKQRWFSGK
EHSLRGVEMT DHAVISAPGA ASGWMLALWR ADYKDGAPGQ TFQLPLDIAW ESREDDPLSR
LLPFTLARVR RVNRVGALYD AMAGPAFPLA LVQAMAEGAN VTTAKGGGLR FTRTQAFPAG
PLPGPEAVAR LGREQSNTSI KLGEDMVLKL YRRVEAGIHP EIEIGRFLTD VAGFANAPPL
LGALEHIDAK GAISALAVLQ GFVRNQGDGW DYMLAYLDRF LDDRAQAPAE VGEGGGPAGS
PHGIIHAQAD ILGQRVAELH HAFATPTEDP AFSAEAVGPE DLASWREQVR TQAAQARQAL
DTALPGLSAE VAGLVTRLLD CWERIDARVD APISLAEGLV KTRIHGDLHL GQVVVVRDDF
HILDFEGEPV KGLSERRDKH CPLKDVAGML RSFDYAAWSA ALSFRQTHAE VRIDVLPALG
VWQEEIREAF LAGYDRAIAG CPSVPALEEG RRDLLSLFML EKALYEVSYE AANRPDWLRI
PIGGVLRLLG EGPSDEIKPR