Gene TM1040_3867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3867 
Symbol 
ID4074930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp119769 
End bp122123 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content56% 
IMG OID638004524 
ProductAlpha-glucosidase 
Protein accessionYP_611259 
Protein GI99078000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.191857 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTGTC TTAAAATCTG GGCTTTGGAA GCGGAAACCA CCACAGGTGT GATTTTGCGC 
GTCGAAGGCC GCCACCTACT GCATATATCT GTACTGGAGG AAAACAGGTT TCGCGTATCT
CTGCAAAAAG ACGCAGAGTG GCGCTTGGAG CGAACATGGA CTGTGGCGCC TGCCGGGGAT
GCCCCATGGG AGGGCCGTCG GCGCGAAGAC ATCTCTGGCT TTTCCTGCCC GCCCCCGCAG
CTCTCGCAAA ACGAGACTGA ACTCGTTTTG TCGACTGAGA CAATGCGGCT TCGTATTTTG
GCACCGCTAC AGATGGTGTG GGAGGCCAAG GTGAACGGAG CATGGAAAGC CTTCGCCGAG
GATCGCCCAA CCGGAGGCAT TCACTTGGGC CTGCGGGATC ATGCCCATGC GCATTTTCTA
TCCCGACACC CGCAAGAACC GGTTTTTGGA CTTGGCGAAA AGACAGGCCC GCTTAACCGT
GTTGGTCAGC GCTACGAAAT GCGCAACTTA GATGCGATGG GTTACGATGC TGAGCGCACT
GACCCACTCT ACAAGCACGT GCCCTTCACT CTGACGCGGA CACAAACTGC CGGGTGTTGG
TCCATCTTTT ACGATAACTT GGCTAGTTGC TGGTTCGATT TCGGCAATGA ATTGGACAAC
TACCACGCCC CTTATCGCGC CTATCGTGCC GAGGATGGCG ATCTCGATTT TTATATGACA
TGGGCGCCTG AGTTGCTCGA CTTGGTCAAA CAACAAGAGC GATTGACCGG CGGCACCGCC
TTTCCGCCGC GTTGGAGCCT TGGGTACTCT GGGTCAACAA TGTCCTACAC AGATGCACCG
GACGCACAGG CGCAGCTGGA AGGTTTTCTA ACTAAGATTG CAGAGTATCG TATCCCTTGT
GACAGTTTTC AGATGTCGTC GGGCTACACC TCGATAGGGC CGAAGCGGTA TGTGTTCAAC
TGGAATGATG AGAAGGTGCC CGACCCTGCG GCTATGGCGG CAAAATTTGC CGATAAGGGG
GTGCATCTCA TTGCCAATAT CAAACCTTGC CTGCTGCAGG ACCATCCGCG TTACGCCGAA
GTCGCCAACG CAGGGTTGTT CGTGCAGGCG AGCGCAGAAG CGGACACGAA CTGCGGACCA
GAACGCTCGG TCTTTTGGGA CGACGAAGGC TCGCATCTCG ACTTTACTAA CCAAGCGACT
GTGGATTGGT GGAAGCAAAA TATCGGCGAG GCGCTGTTGC AGCGCGGTAT AGGCTCAACC
TGGAACGATA ACAACGAATA CGAAATCTGG GACCGACACG CTCAATGCGC GGGTTTCGGT
AAGGCAATCG ACATGTCGCT AATGCGTCCT GTAATGCCCA TACTTATGAC GCGCGCATCA
ATGGAGGCGC AGGAGGCCCA TGCACCGGAG AAGCGGCCCT ACCTAATCAG TCGATCAGGC
GCCCCGGGAT TGCAGCGTTA TGCCCAGACT TGGAGCGGAG ACAATCGCAC AGATTGGAAG
ACGCTGCGCT GGAACCAGCG GATGGGCCTC GGCATGAGCA TGTCTGGCTT TTATAACATT
GGACATGACA TCGGTGGCTT CTCTGGGCCG CGGCCAGAAC CGGAGCTGTT TGTGCGCTGG
GTCCAAAATG GTGTGTTCCA TCCTCGTTTT ACCATTCACT CATGGAATGA TGATGCGACA
GTAAACGAGC CTTGGATGTA TCCTGAGGTG ACGGACCACA TCCGCGCGGC GATAGAATTG
CGGTACCAAC TTCTGCCTTA TCTCTACACC TGCCTGTGGC AGGCGGCCGA GCGCAGTGAG
CCGATGCTAC GACCTTTGTT CCTTGATTTT GGTGCTGACC CTCAAGCTTG GGAGGAGAGC
GATAGCTTTC TTCTCGGGCG GGATCTCTTA GTGGCGACAG TCTTGGACAA GGGTGTTGAT
GCGATCTCGG TCTATTTACC GCGGCATTCC GGGGGCTGGT GGGATTTCCA CACTGGTCTT
TGGCACGAAG GCGGACAATG GCTCACCGTG CCTGTCGCTC TTGATACCAT TCCGCTGTTT
ATCCGCGGGG GCGCCGTGGT TCCGATGGGC CAAGGGGCAG ACCGCGCAGC ACCCGAGATG
GAGATCGCGC GACTTTTGGC TGTGTTCCCC GCGCAAGGCG TACAGGAAAC GACCAGCCTC
CTCTATGAGG ATGACGGTGT GACAAAGACG GGTAAGTGGT GCCTGTCTCA TCTCGTCTTG
AACAGCACCA ATGACCACAT CAGCCTGATC TCGAAACGTG AGGGTAATGG CGCGCCGGTT
TTACCAGAGG CACTGGTGGT CCTGCCGGCT GGTGAACGAC GCATTCTGAA GACGGGAACG
AGGATCGACT TATGA
 
Protein sequence
MKCLKIWALE AETTTGVILR VEGRHLLHIS VLEENRFRVS LQKDAEWRLE RTWTVAPAGD 
APWEGRRRED ISGFSCPPPQ LSQNETELVL STETMRLRIL APLQMVWEAK VNGAWKAFAE
DRPTGGIHLG LRDHAHAHFL SRHPQEPVFG LGEKTGPLNR VGQRYEMRNL DAMGYDAERT
DPLYKHVPFT LTRTQTAGCW SIFYDNLASC WFDFGNELDN YHAPYRAYRA EDGDLDFYMT
WAPELLDLVK QQERLTGGTA FPPRWSLGYS GSTMSYTDAP DAQAQLEGFL TKIAEYRIPC
DSFQMSSGYT SIGPKRYVFN WNDEKVPDPA AMAAKFADKG VHLIANIKPC LLQDHPRYAE
VANAGLFVQA SAEADTNCGP ERSVFWDDEG SHLDFTNQAT VDWWKQNIGE ALLQRGIGST
WNDNNEYEIW DRHAQCAGFG KAIDMSLMRP VMPILMTRAS MEAQEAHAPE KRPYLISRSG
APGLQRYAQT WSGDNRTDWK TLRWNQRMGL GMSMSGFYNI GHDIGGFSGP RPEPELFVRW
VQNGVFHPRF TIHSWNDDAT VNEPWMYPEV TDHIRAAIEL RYQLLPYLYT CLWQAAERSE
PMLRPLFLDF GADPQAWEES DSFLLGRDLL VATVLDKGVD AISVYLPRHS GGWWDFHTGL
WHEGGQWLTV PVALDTIPLF IRGGAVVPMG QGADRAAPEM EIARLLAVFP AQGVQETTSL
LYEDDGVTKT GKWCLSHLVL NSTNDHISLI SKREGNGAPV LPEALVVLPA GERRILKTGT
RIDL