Gene Rru_A1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1607 
Symbol 
ID3835024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1898895 
End bp1901081 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content68% 
IMG OID637825699 
Product4-alpha-glucanotransferase 
Protein accessionYP_426694 
Protein GI83592942 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACG GCACCCCCGG AGTATTGAGG GTTCTTGCGG AACGGGCAGG GGTTATCGAC 
CGCTGTTATG ATGCTTTCGG ATCGTTGATG GTCGCCCCCG ACGCGGCGCT GAAGGCGGCT
TTGGCGGCTT TGGGCTGGCC GGCGGAGGAC GAGGCGCAGG CGGCGCGGTC GCTGGCCGCC
CTTGAGGCCG CCCCCTGGGC CGAGGCGCTG CCGCCTTGCG TCATCCATCG CCTGGGTGAC
ACGCCCGCCG TCCTGACGGT GCCGGTGGTG ACGACGGGCG ACTCGTTTGG CCCCGGCCGC
TGGAGTCTGA CCCCCGAGGA GGGAGCGCAA GCCCTACCGG CGACCGGGAC CTTCGATCCA
CAAAGCCTGA CGGTCGAGGA GTCGGGGCCG GAGGGTAAGA CACGGCGCCG CCTTGACCTC
GGCCTCGATC TGCCGATGGG GTATCATCGC CTGCGCGTCG AGGGACTTGC CCCGACGGCG
CTGGAGACCG TGCTGATCGT CGCGCCCAGG CGCTGTTGGC TGCCCGAGGG CTGGGAAGAC
CGCGAGGGGC CGCGTTTGTG GGGCGTGGCG ACCCAGGTTC ATGGCCTGCG CAGTCTCAAC
GACTGGGGAA TGGGCGATTT CGACTCGTTG GCCGTGCTGG CGCGGCTGAC CGGCCGCCAG
GGCGGTGATC TGTTGGGCGT CAGCCCGCTT CACGCGCTGT TTCCGGCCCG CCCGCTTCAT
GCCAGCCCCT ATTCGCCCAA CTCCCGGCTG TTCCTCAATC CGCTCTATAT CTCGATCCCC
AGCGTTCCGG AATACGACCT CAGCCCCGAG CTTGCCGCCT TCACCAACGA TCTCGGCGCC
CTTCAGCAGC CCGAACTGAT CGATCTCGCG GCGGTCGCCG AGGGCAAATG GTCGGCTTTG
GCCATTCTTC ACCGGCGTTT TGTCGCCCAG CACCTCGGAC GGCGGACGCC GCGCGGCCAA
GCCTTTACCG CCTTCCTGAA AGAGGGCGGG GAGCACCTTG AAAAATTTGC CATTTTCCAG
ACCCTGTCCG AGCACTTCAA GGAGAGGGGG TGGTCGTGGC GCGACTGGCC GGCTGAGTTT
CACGATCCGG CGGCGCCGGC GGTGGCCGCG TTCGCCGGCG AGCAGGCGGC CCGGGTGCTG
TTCCATCAAT GGTGCCAGTT CGAGGCCGAT CGCCAATTGG CGCTGGTGGC CAAGGCTGCG
GCCGATGCCG GTCAGCGCGT CGGGCTGTAC CGCGACCTCG CCCTGGGGTC CGATCCCACC
GGCGCCGATT GCTGGGCCCT GCGCGGGGTC ATGGCCGAGG GGTTGTCGGT GGGCGCCCCC
CCCGATCCGT GGAACGCCAA GGGCCAGAAC TGGGGATTCC CGCCCTTTCA CCCCGGCCAC
CTGCGCCAAG CCGCCTATCA GCCCTTCGCC CGGATGATCC GCGCCAATAT GCGCGAGGCC
GGGGCCCTGC GCATGGATCA TGTGCTGGGG CTGATGCGGC TGTTTTGCAT TCCCTGGGGG
ATGGAGGGCC GCGAGGGCTT CTATCTGAGG ATGCCCTTCG AGGATCTTCT GGCCGTTCTC
GCCCTGGAAA GCCACCGGGC GCGCTGCATC GTCGTTGGCG AAGATCTGGG CACCGTGCCC
GACGGCTTCC GCGATCGCAT GCGCGAGGCG GCGGTGCTGT CTTATCGGTT GTTCTTCTTT
GAACGGGGCC ACGATCAGGC GCTGACCCCC CCGGAAGCCT ATCCCCGCTT GGCCACCGTC
GCCGCGTCGA CCCATGATCT GCCGACCCTG GCCGGGTTCT GGGCCGGGCG CGATCTGGCG
TGGAAAAACG ACCTCGATCT GTTTCCCTCC GAGCAGGCGC GCAACGCCGA AAGCATGGAT
CGCCACAACG ATCGGCCGCG AATCCGCGCG ACTCTGGCGG CGGCGGGAAC GCCGCTGTCT
GACCCCGGGA CCTTCACGCC CCCCCCCGAA CTGGTGCCGG CGGTGACGGC GTGGCTGGCC
CGCACCCCCA GCCTTATTCT CATGCTTCAA ATTGAAGACG TGCTGGAAAT GCCCGAACAG
GCCAATATGC CGGGAACCAT CGATCAGCAT CCCAACTGGC GCCGACGAAT TCCGCTGGCG
GTGGAGGCTT TGGAGGTGGA CGGAAGGCTG GGCCGTCTGG CATGTATCCT GGCGTCCCAC
GGACGGGGGG CGGCAGCAAA AAGGTAA
 
Protein sequence
MTDGTPGVLR VLAERAGVID RCYDAFGSLM VAPDAALKAA LAALGWPAED EAQAARSLAA 
LEAAPWAEAL PPCVIHRLGD TPAVLTVPVV TTGDSFGPGR WSLTPEEGAQ ALPATGTFDP
QSLTVEESGP EGKTRRRLDL GLDLPMGYHR LRVEGLAPTA LETVLIVAPR RCWLPEGWED
REGPRLWGVA TQVHGLRSLN DWGMGDFDSL AVLARLTGRQ GGDLLGVSPL HALFPARPLH
ASPYSPNSRL FLNPLYISIP SVPEYDLSPE LAAFTNDLGA LQQPELIDLA AVAEGKWSAL
AILHRRFVAQ HLGRRTPRGQ AFTAFLKEGG EHLEKFAIFQ TLSEHFKERG WSWRDWPAEF
HDPAAPAVAA FAGEQAARVL FHQWCQFEAD RQLALVAKAA ADAGQRVGLY RDLALGSDPT
GADCWALRGV MAEGLSVGAP PDPWNAKGQN WGFPPFHPGH LRQAAYQPFA RMIRANMREA
GALRMDHVLG LMRLFCIPWG MEGREGFYLR MPFEDLLAVL ALESHRARCI VVGEDLGTVP
DGFRDRMREA AVLSYRLFFF ERGHDQALTP PEAYPRLATV AASTHDLPTL AGFWAGRDLA
WKNDLDLFPS EQARNAESMD RHNDRPRIRA TLAAAGTPLS DPGTFTPPPE LVPAVTAWLA
RTPSLILMLQ IEDVLEMPEQ ANMPGTIDQH PNWRRRIPLA VEALEVDGRL GRLACILASH
GRGAAAKR