Gene GSU2360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2360 
Symbol 
ID2686132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2582132 
End bp2585122 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content60% 
IMG OID637127051 
Productmaltooligosyl trehalose synthase 
Protein accessionNP_953407 
Protein GI39997456 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAA ATGTTACCCT GGAGGCACGT GTCCCCACAG CAACCTATCG CCTGCAGTTC 
AACGGTGACT TCCGCTTCTG CGATGCGGCG CGGCTCGTCC CCTACCTGGA CGCCCTGGGG
GTGAGCGACA TCTACGCCTC TCCTTTCCTG AAGGCGCGTA CCGGAAGCAT GCACGGCTAC
GACATCGTCG ACCACAACCG CCTGAACCCC GAACTGGGAA GCCGGGACGA CTTCAATGCT
TACTGTGCGA TGCTACAGCG CCACAGCATG GGGCAGATCC TCGACTTCGT TCCCAACCAC
ATGTGTGTGG AAGGGGGAGA GAACGAACGT TGGCTCGATC TGCTGGAAAA CGGCCCCAGC
TCCGCTCACG GAGAATTTTT CGACGTAGAC TGGTCGCCGG TCAAGAAGGA ACTGACCGAC
AAGGTGCTGA TCCCGGTCCT GGGGGACCAG TACGGGACGA TCCTGGAAAA CGGGGAACTG
GTTCTTTCCT TCCAGGAAGG GGCATTTTTC ATCAGCTATT ACGAGCACCG CTTCCCCATC
ATCCCCAAGA CCTACAGCCC GATACTCACC CACCGGCTCC ACGAGCTGGA GCGGCTTTTC
CCACCTGACC ACGAGGCATA CCGGGAACTC CTGAGCATCG TCACGGCCAT CGACCATCTC
CCCTTCTACA CGGAGCGGGA TACGGAACGG GTCCGGGAAC GCTACCGGGA GAAGGAGGTA
ATCAAGCGCC GGCTGTGGAC CCTTTGCTCC GAAAACTGGC CTATCAAAGA GTTCATCGAT
GAAAACGTCC GGATTTTCAA TGGAGAGAAG GGGAATCCAC GCAGCTTTGA CCTTCTGGAC
GGGCTTTTGC GCCAGCAAGT CTATCGCTTG GCTCACTGGC GCACCGCCAC GGACGAAATC
AACTACCGCC GGTTCTTCGA CATCAACGCC CTGGGCGCCA TCCGGATGGA AACTCCCCGG
GTTTTCGAGG AAACCCACCG CCTGGTTATG GAATTGGTGA GCGAAGGCAC GGTAACCGGC
CTGCGGATCG ATCATGCAGA CGGGCTCTAC GATCCCACCG ATTATTTCAG ACGGCTCCAG
CGGGCCTGTT TCCTGGAAAC GCGCCTGGCA AGCCTCGGCG GGTCCGCTGG CGAGACTCCG
GAAACCCTGA GAGAGACGAT CCTGCAAATG TACGACGAAA TGTTGGAGGT TTCGCCCCAG
ACCATGCCCT TCTACATCGT CGGCGAAAAG ATTCTGATGA AGAACGAACG GCTCCCCGAA
GACTGGCCGA TCCATGGCAC TACCGGGTAC GAATTCGCTA ATGCCGTAAC CGGCCTCATG
GTTGATACCC GCAACGGACG GGAGTTCGAT GCGATGTACG CTCGGTTCAT CCAGGAACGG
CCGAATTTCG CCGAAATAAC CTACTTGAAG AAGAAACAGG TCATGCGGTT CTCCATGGGG
GGCGAAATCA ACACCCTAGG GCACTACCTG AACACCCTCT CGGAAAGCAA CCGCCACACC
CGCGACTTCA CCCTCGGCAG TCTCACCAGG GCGTTGATGG AAGTGATCGC CCACTTCCCC
GTCTACCGAA CCTACACGGC AACACGCAAG GTGGCGGACC GAGACCGCCA GTACATAGAG
TACGCCGTGG CCAAGGCTAA ACGACGCAAT CCCGCCATGA GCGAATCGAT CTTCACATTC
ATCGAAGATG TGCTCCTGCT CCGTTTTTAC GACAGCACCG GCGGGGAAGA GCAGAAGCAA
TGGCTCGATT TCGTCATGAA GTTCCAGCAG CTCACGGGAC CAGTCATGGC CAAGGGGCTG
GAGGACACTG CATTTTATGT GTTCAACCGG CTCGTCGCCC TCAACGAGGT GGGTGGGACG
CCCGAGCGGT TCGGGCTCAC CATGGAGGCG TTCCACGGCC AAAATATCGA GCGGGCACGC
AGCACCCCTT TCACCATGCT GGCCACATCG ACCCACGACA CCAAGCGGAG CGAGGATGTA
CGAGCCCGGA TCAGCGTCCT GTCCGAAGAT CCTACCTTCT GGCACGATTG TCTGATGCGC
TGGAGCCGGA TAAATCGAGG CCACAAAGTG ATCGTGCAGG GGGTTAAAGT TCCGGACCGG
AACGAGGAGT ACCTCCTCTA CCAAACCATC GTGGGCGCTT GGCCCGCTGA GGAATTCACG
GCCGATGGGC ACGGCGCGTT CGTTGACCGG GTCAGGCAGT ACATGCTCAA GGCCATGCGG
GAAGCCAAGG TGAACACCAG CTGGATCAAC CCCAACCCGG TCCACGAGGA GGCGGTCCAC
CACTTCGTCG ATGCCATTCT CCGGAACGTA CCGACCAACG GGTTCCTTGC CGATCTGCGC
CGGACACTTC CCCCCCTTGT CCGCTGCGGC ATGCTGAATT CCCTTTCCCA GACCCTGCTC
AAGGCGGCGT CCCCCGGCAT CCCAGACTTC TACCAAGGGA CGGAACTCTG GGATTTCAGC
CTGGTCGACC CGGACAACCG GCGACCGGTG GACTTTGACA AGCGGTCGGT TATGCTGGAA
GGATTGCGAT TGGCTGAGCA GGAACGGGGC ACCCTCGCCC TGGCCCGGGA ACTTCTCGCC
GACATGGCTG ACGGCAGAGT CAAACTCTTT CTGGTCTGGA AGACCCTCTG CTTCCGCAGG
GATCACCGGA GCCTCTTCGA AGCGGGCAAG TATCTTCCCT TGGAGGTTCA GGGCGAACGC
AGCGACAACG TCTGCGCCTT CGAACGGTAC AACGACGGAG AGTCCGTCAT TGCCGTGGCA
CCCCGCTTCT TCAGCCGATT GGGGGCAGTT CCTGCGGGTG GCGAAACCTG GCAGGACTCC
CGGATCGTTA TTCCGTTCGA AAGTGCCGGC TGCGCCTATC GCAACATCTT CACCGGTAAA
CGGGTCGTGA CCTCGCCGCG GGAAGGGCAG ACGATCCTGC CCCTTGCCGA CGTGCTGGCC
GACTTCCCCG TAGCACTGCT TGAGGCAGTG AGAGAAGAGC TTCCACGCTG A
 
Protein sequence
MQQNVTLEAR VPTATYRLQF NGDFRFCDAA RLVPYLDALG VSDIYASPFL KARTGSMHGY 
DIVDHNRLNP ELGSRDDFNA YCAMLQRHSM GQILDFVPNH MCVEGGENER WLDLLENGPS
SAHGEFFDVD WSPVKKELTD KVLIPVLGDQ YGTILENGEL VLSFQEGAFF ISYYEHRFPI
IPKTYSPILT HRLHELERLF PPDHEAYREL LSIVTAIDHL PFYTERDTER VRERYREKEV
IKRRLWTLCS ENWPIKEFID ENVRIFNGEK GNPRSFDLLD GLLRQQVYRL AHWRTATDEI
NYRRFFDINA LGAIRMETPR VFEETHRLVM ELVSEGTVTG LRIDHADGLY DPTDYFRRLQ
RACFLETRLA SLGGSAGETP ETLRETILQM YDEMLEVSPQ TMPFYIVGEK ILMKNERLPE
DWPIHGTTGY EFANAVTGLM VDTRNGREFD AMYARFIQER PNFAEITYLK KKQVMRFSMG
GEINTLGHYL NTLSESNRHT RDFTLGSLTR ALMEVIAHFP VYRTYTATRK VADRDRQYIE
YAVAKAKRRN PAMSESIFTF IEDVLLLRFY DSTGGEEQKQ WLDFVMKFQQ LTGPVMAKGL
EDTAFYVFNR LVALNEVGGT PERFGLTMEA FHGQNIERAR STPFTMLATS THDTKRSEDV
RARISVLSED PTFWHDCLMR WSRINRGHKV IVQGVKVPDR NEEYLLYQTI VGAWPAEEFT
ADGHGAFVDR VRQYMLKAMR EAKVNTSWIN PNPVHEEAVH HFVDAILRNV PTNGFLADLR
RTLPPLVRCG MLNSLSQTLL KAASPGIPDF YQGTELWDFS LVDPDNRRPV DFDKRSVMLE
GLRLAEQERG TLALARELLA DMADGRVKLF LVWKTLCFRR DHRSLFEAGK YLPLEVQGER
SDNVCAFERY NDGESVIAVA PRFFSRLGAV PAGGETWQDS RIVIPFESAG CAYRNIFTGK
RVVTSPREGQ TILPLADVLA DFPVALLEAV REELPR