Gene GSU2361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2361 
Symbol 
ID2685758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2585158 
End bp2588493 
Gene Length3336 bp 
Protein Length1111 aa 
Translation table11 
GC content65% 
IMG OID637127052 
Productalpha amylase family protein 
Protein accessionNP_953408 
Protein GI39997457 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.124946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCGCA GGACACCGCT GTCGGACGAC ACCCCCCTCT GGTACCGGGA CGCCGTCATC 
TACCAACTCC ACGTGAAGGC CTTCGCCGAT TCCGACGGCG ACGGCGTGGG GGACTTCCGG
GGCCTCATGG GGAAGCTCGA TTATCTCCAG TCCCTGGGGA TCACCGCCAT CTGGATACTT
CCCTTCTACC CGTCCCCCTT GCGCGACGAC GGCTACGACA TTGCCGACTA CTACAACGTC
AACCCCAGCT ACAACACCCT GCGCGAATTC CGGGAATTCC TCAGGGAGGC CCATGCCCGC
CGCATCCGGG TCATCACCGA GCTGGTCCTC AACCACACCT CGGACCAGCA CCCCTGGTTC
CAGCGTGCCC GCCGGGCAAA ACCGGGCTCG GTCCACCGGG ACTACTACGT CTGGAGCGAT
ACCCCCGACC GGTACCGGGA AACCCGGATC ATCTTCCAGG ACTTCGAGAC GTCCAACTGG
AGTTGGGACC CGGTGGCCAA GGCCTACTAC TGGCACCGGT TCTATTCCCA CCAGCCCGAC
CTCAATTTCG ACAACCCCCG GGTCCAGTCG GAAGTGCTCC GGATCATCGA CTACTGGCTG
GGGATGGGTG TGGACGGCGT CCGGCTGGAT GCCGTCCCCT ACCTCTTCGA GCGGGAAGGA
ACCAACTGCG AAAACCTGCC CGAAACCCAT GACTTCCTGA AAAGGCTCCG GGGACACATG
GACGCCTCGT TCCCCAACCG GATGCTGCTG GGCGAGGCCA ATCAATGGCC CGAGGACGCG
GTGGCCTATT TCGGCGAGGG GGACGAGTGC AACATGCTCT TCCACTTCCC GCTGATGCCC
CGCATGTACA TGGCCATCGA GATGGAGGAC CGCTTCCCGG TGGTCGACAT CCTCGACCAG
ACCCCCGCGA TCCCCGAGGG ATGCCAGTGG GCCATCTTTC TCCGCAACCA CGACGAGCTG
ACCCTGGAGA TGGTGACCGA CGAGGAGCGG GACTACATGT ACCGGATCTA CGCCTCGGAC
CCCAAGGCCA GGATCAACCT GGGAATCCGG CGGCGGCTGG CGCCCCTCCT CGGCAAGGAC
CGGCGGCGGA TCGAACTCAT GAACGCCCTG CTCTTCTCCC TGCCCGGCAC CCCCGTCATC
TACTACGGCG ATGAGATCGG CATGGGGGAC AACTACTACC TGGGGGACCG CAACGGCGTC
CGTACCCCCA TGCAGTGGAG CCCCGACCGC AATGCCGGTT TTTCCGGCGC CAACCCCCAG
CGCCTCTTCC TGCCGGTCAT CATCGATCCC GAATACCACT ACGAAGCGGT AAACGTGGAC
ATCCAGGAGC GCAACCCCAC CTCGCTGCTC TGGTGGATGC GGCGCATCAT CGCCGTGCGC
CGCCGCTACC GGGCCTTCAG CCGCGGCGCC ATGGAGATGC TCTATCCGGC CAACCACAAG
GTCCTGGCCT TTCTCCGCCG CCACGAGGAT GAAGTGATCC TCGTGGTGGT CAACCTCTCC
CGCTTCGCCC AGGCCATCAA CCTGGACCTT CAGGAATTCG CCGGCATATC CCCCGAAGAC
CTCTTCAGCC GGAATCGCTT CCCGGTCATC CGCGAAGCCA CCTATCCCCT CACCCTGGGC
CCCCACGACC ACTTCTGGTT CCTGCTGCGG CGCAGGGAAG CGCCGCTTCA GGCCGAGGAA
GAACTGCCGC GGATCAGGCT TCGGGGAGAA CTCCCCTGGT GGGAGACTCT CCGCCAGACC
AGGGGCGACG CTCACCTGGA ACGGGTGACC ACCGATTACC TCAAGCGAAG CGGTTGGTTC
CGGGGAAAAG CGAGGGCAAT CATCGGCTAC GTGCTGCGCG ACACCGTGCT CCTGCGGCAG
GGGGAACGGG TGTTCCCCAT CTTCTTCCTT GAGGTACGCT ACCAGGACGG CCCGCCGGAA
ACCTACCTGA TTCCCGTGGC GTTCCTCACC GGTGCCGGAG CGAAACGGGT AAGCGCCGAA
TCCCCCCGGG CGGCCATAGC CGCTCTTTCC GTGGGGGACG AAGAGGGCAT TCTCTGCGAT
GCGGTCCATG ACCGGGAATT CAGGGACGCG CTCCTTGCCC TGGCACTCGG CCGTCGCCGG
CTCCATGGCG AGGAAAACAG TCTCATCGTG CCGGTTCACG CCGGAGCAGG ACGCATCGGC
TCCAGGGAAG AGATGCGTCA TCTCGCCTCC GAACTCCCCA AGGCGGAACA GACCAACAGC
ATCATCACCT ACGGCGATCG CCACGTGCTC AAGCTCTACC GCAAGGTGGA GCCGGGCATC
AATTCCGAGG TGGAGATGGC GCGGTTCCTC ACCGCAAAAT CGAGCTACCC CAACGTTGCC
CCCTTCCAGG GGAGCCTGGA ATTGCGGCAT CCCGGCGTCG AGCCGGGCGC TATCGGCCTG
CTCCAGGGAT ACGTACAGAA CCAGGGCGAC GGCTGGCGAC TCTCCCTCCA CCTCCTGGGC
CAGTACTTCG AGCGGATTCT CTCGTGCCGG GGCGAGCTGC CACCGCCGCC GTCGCGTTTT
TCGAGCCTCA TGGACGGCAG CGCCTGCACC GTGCCGGAAC CGGCGGCGGA GCTCATCGGC
GGCTTTTATT TGGAGATGGC TGGACTTCTC GGCCGCCGGA CCGCCGAGCT CCACTTGGCC
CTCGCCGCCG GGGGAAACGA CCCGGCTTGG CGGCCCGAGG AGTATTCCAC CCTCTACCAG
CGCTCCGTGT ACCAGTCCAT GCGCAACCAG GCCCGACGCA GCCTCCAGCT CCTGGCCCAG
CACCACAAGG ACCTGCCCGC AGAGGCGTTG GCATCGGCCG ACAGGATTCT GAGCGCGGAA
AAGGAACTTC TCGCCTGTCT GCGTCCCATC GTCGGCCGGC GGATTCAGGC CATGAAGAGC
CGCATCCACG GCAACTTCAG GTTGGAAAAG GTTCTCTTCA CCGGCAAGGA CTTCATGATA
ATCGACCTGG AGGGCGAGCC GGACCGGCCC CTCGGCGAGC GCCGCATCAA ACGCTCCCCC
CTGCGGGATG TGGCGGGAAT GATCCGCTCC TTCCACAACG CCACCCTGAC GGCCCTTGCC
CGCCACGGCG CTGGCCATCC CGGCGACATC CCGCTCCTGG AGCCCTGGGC GGAGGCCTGC
TGGTATCACG TGAGCTGCCG CTATCTGGCC GGCTACCTGG AGCAGATGGG CACAAGCCCC
CTCGTCCCGA CGGAGCGGAG CGACCTCGAG ACCCTCCTCC GCTCGTTCCT GCTGGACAAT
GCCCTCCACG AACTCGGCTA TGCCCTCGCC AACCGCCCCG AACGGGTTTC CTCTTACCTG
CGCGGGGTCG AGACGGTGCT GCGGGAGTTC CGGTAA
 
Protein sequence
MARRTPLSDD TPLWYRDAVI YQLHVKAFAD SDGDGVGDFR GLMGKLDYLQ SLGITAIWIL 
PFYPSPLRDD GYDIADYYNV NPSYNTLREF REFLREAHAR RIRVITELVL NHTSDQHPWF
QRARRAKPGS VHRDYYVWSD TPDRYRETRI IFQDFETSNW SWDPVAKAYY WHRFYSHQPD
LNFDNPRVQS EVLRIIDYWL GMGVDGVRLD AVPYLFEREG TNCENLPETH DFLKRLRGHM
DASFPNRMLL GEANQWPEDA VAYFGEGDEC NMLFHFPLMP RMYMAIEMED RFPVVDILDQ
TPAIPEGCQW AIFLRNHDEL TLEMVTDEER DYMYRIYASD PKARINLGIR RRLAPLLGKD
RRRIELMNAL LFSLPGTPVI YYGDEIGMGD NYYLGDRNGV RTPMQWSPDR NAGFSGANPQ
RLFLPVIIDP EYHYEAVNVD IQERNPTSLL WWMRRIIAVR RRYRAFSRGA MEMLYPANHK
VLAFLRRHED EVILVVVNLS RFAQAINLDL QEFAGISPED LFSRNRFPVI REATYPLTLG
PHDHFWFLLR RREAPLQAEE ELPRIRLRGE LPWWETLRQT RGDAHLERVT TDYLKRSGWF
RGKARAIIGY VLRDTVLLRQ GERVFPIFFL EVRYQDGPPE TYLIPVAFLT GAGAKRVSAE
SPRAAIAALS VGDEEGILCD AVHDREFRDA LLALALGRRR LHGEENSLIV PVHAGAGRIG
SREEMRHLAS ELPKAEQTNS IITYGDRHVL KLYRKVEPGI NSEVEMARFL TAKSSYPNVA
PFQGSLELRH PGVEPGAIGL LQGYVQNQGD GWRLSLHLLG QYFERILSCR GELPPPPSRF
SSLMDGSACT VPEPAAELIG GFYLEMAGLL GRRTAELHLA LAAGGNDPAW RPEEYSTLYQ
RSVYQSMRNQ ARRSLQLLAQ HHKDLPAEAL ASADRILSAE KELLACLRPI VGRRIQAMKS
RIHGNFRLEK VLFTGKDFMI IDLEGEPDRP LGERRIKRSP LRDVAGMIRS FHNATLTALA
RHGAGHPGDI PLLEPWAEAC WYHVSCRYLA GYLEQMGTSP LVPTERSDLE TLLRSFLLDN
ALHELGYALA NRPERVSSYL RGVETVLREF R