Gene Sama_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2454 
Symbol 
ID4604703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2959842 
End bp2962073 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content59% 
IMG OID639781851 
Product4-alpha-glucanotransferase 
Protein accessionYP_928328 
Protein GI119775588 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTGG AAAAGCTCTT TTATCTGCAA GGTGTGGGAG ACAGGTTCAT CGACTGCGAT 
GGCCGAGAAC AAGCCATACC CGAATCCGCT CGGTTGGCAA TGCTCAAATC CCTGATGGGC
CTCGAACAGT CCCCCGACGG GGCCGAAATA GCCGCCAGGG TGGACACCCT CGACAGCCTC
CCTTGGACTA AGTTGCTGCT GCCGCTGCAA TGGTGTTTTG AAACCCGTCC GTCAGTTGAA
TGCCAGCTGC CTGAAAACCT GAAGCAAGAC ATGGAATTAA CCCTGATTTC AGAGCAGGGG
GAGCGTGTAA CGCTGACGCT TTTGGCAAGG GATGCCGAGA CGACCGGTGA TTATCAAAGG
CCCGATGGCA GATACCTTCG CTGTCGCTAT TCCCTGCCAA CGCTGGCCAT GGGGGAGTTT
CAGCTGCAGC TGTCCCACCC TGTCCTCGGC CAGGGAAAGG GAGCCTTGCT GATTATCCCC
GAGCAGGCCT ACGGTGGCCC ACCGGGCCTT GGCAAGCGTC CCTGGGGCCT GGGGATAAGC
CTGTTTACCC TGAGAAGCCA GCGGCAGTGG GGCATCGGTG ACCTGGCCGA CCTGCAAACC
CTGATAGAAC TCTCCTCAGA AGTCGGGTGC GATTTTATCA CCCTGACGCC CTTGCATGCC
CCGGACATTG CCAACCCCGG ACTCGCAAGC CCCTACAGCC CCTGGGACAG GCGCTTTCTC
AATCCACTCT ATATCGCCAT CGACTTGGTG ACCGAATACA GCCAATTGGC GCAGGAGTTC
AATGGCGGCG ACTGGCGCCG GGAACGCCTT ATTCTCAACC AGGCAAGCCA GATTGATTAC
CCCAAACTTG CGCTGCTGAA GTATCGGGCG CTGGCAAAGT TGTTCGCTGC TTTTGGAAAG
CTGGATGAAT TCAGCGGGCG GCGAGTGCGC TTCGAACGTT TCGTGGCCGA GGGCGGCACG
GCGCTGAAGG ATTTTTGCCG TGATGTAAGC CAAAGGGCAT TAAGCCTGGA GGGGGAATGG
CATCGGGACC CAGCGCTGGC AGAAGCGCTG CAACGGGACG AGTTTCATGC CTGGCTGCAG
TTTGTGGCCG ATGAGCAGTT GTCGCTGTGT CAGCTTCGCT GCCGTCAGGT GGGCATGTCT
CTTGGTTTGG TGAGGGATTT GGCCGTGGGC GCCCTGGCTA TCGGCAGTGA AGTGAGCCGC
AGTGGCGTAT TTTGCCTTAA TGCCGATATC GGGGCGCCAC CGGACCCCTT CGCCCGTCAG
GGGCAAAACT GGGGCATGCC GCCGATGGAC CCTGTGGCGT TGAAAGACTC GGCCTTAAGC
CATCTTAAAT CCCTTTATCG AAGCAACATG CAAAGCTGTG GCGCGCTCAG AATCGACCAT
GTGATGGCCC TGACCCGGTT GTGGTGCTGG CCAGAAGGCG ATGACAACGG TTGTTACCTC
TACTATCCCC AGGAACTGAT GCTGGCGGTC TTGTGCCTTG AGAGCCACAA AAACCGCTGC
CTCCTGATTG GGGAGGATCT CGGCACAGTG CCGCCGCTGC TCAAGGGCTT GCTGCGGGAG
CGGGGCATTC TCGGGAACGA CGTATTTTAC TTTTGCCGGG ATGGCAGCGG ATTTTCCGCG
CCTCGGGAGC ACAGAGCCGG TGCCATGTTG CAGCTTGGCA ATCAGGATGT GCCGCCATTC
ATGGCCTGGT GGCGCGGCAT TGATCTCGGC TTACTCAATC AGCTTGGGCT TTTGCCGTCT
GTGGAAGAGG CGCACTTGAG CAGAGTCGCA CAGTGCCGGG GTTTGCTCGA AAGCCTGGTG
GAAGCCGGAT GGCTGCGCCG GGAGGCCCTG ACCTGGCAGC CCTGTGATGC ATTACTGCCA
GCGCCAGCGG GCAATATATC GGCAGGCACT ACATCGGCAG GTAACGCCAC CGACAGTATT
CGTCAGGGGA CGGCGGTTTT TGATGCCTTA CTCAACTGGC TGCCCGGCAG CCGCGCCAAA
TTGTTTTCTG TCAGCCTTTG GGATCTGGCC CTTGAATCCC AGCCCATCAA TATTCCCGGC
ACCAGTTTTG AGTACCCCAA TTGGCGTGCA CGCATGAGTC AGTCATTGGA GAGCCTTTGG
CAAAGCCGCG AGTTCAGAGC GCGACTGGAA GCGATTCGCG CCGGGCGCAG TCAGCCTCAA
GGTGGGCGCC CAGGCCCGTC TGTCATAGGC GAAGCCATGG CATCTGCCGA TGACAGAGAG
CGCCAAAACT GA
 
Protein sequence
MGLEKLFYLQ GVGDRFIDCD GREQAIPESA RLAMLKSLMG LEQSPDGAEI AARVDTLDSL 
PWTKLLLPLQ WCFETRPSVE CQLPENLKQD MELTLISEQG ERVTLTLLAR DAETTGDYQR
PDGRYLRCRY SLPTLAMGEF QLQLSHPVLG QGKGALLIIP EQAYGGPPGL GKRPWGLGIS
LFTLRSQRQW GIGDLADLQT LIELSSEVGC DFITLTPLHA PDIANPGLAS PYSPWDRRFL
NPLYIAIDLV TEYSQLAQEF NGGDWRRERL ILNQASQIDY PKLALLKYRA LAKLFAAFGK
LDEFSGRRVR FERFVAEGGT ALKDFCRDVS QRALSLEGEW HRDPALAEAL QRDEFHAWLQ
FVADEQLSLC QLRCRQVGMS LGLVRDLAVG ALAIGSEVSR SGVFCLNADI GAPPDPFARQ
GQNWGMPPMD PVALKDSALS HLKSLYRSNM QSCGALRIDH VMALTRLWCW PEGDDNGCYL
YYPQELMLAV LCLESHKNRC LLIGEDLGTV PPLLKGLLRE RGILGNDVFY FCRDGSGFSA
PREHRAGAML QLGNQDVPPF MAWWRGIDLG LLNQLGLLPS VEEAHLSRVA QCRGLLESLV
EAGWLRREAL TWQPCDALLP APAGNISAGT TSAGNATDSI RQGTAVFDAL LNWLPGSRAK
LFSVSLWDLA LESQPINIPG TSFEYPNWRA RMSQSLESLW QSREFRARLE AIRAGRSQPQ
GGRPGPSVIG EAMASADDRE RQN