Gene Oter_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOter_3221 
Symbol 
ID6204389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOpitutus terrae PB90-1 
KingdomBacteria 
Replicon accessionNC_010571 
Strand
Start bp4204588 
End bp4207575 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content66% 
IMG OID641692888 
Productglycoside hydrolase family protein 
Protein accessionYP_001820101 
Protein GI182415035 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.179721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCTG AGCCGGCGTT CACCCCGCTC GCCGGGGAAG CCCGCCGGCA CCGACGCCTC 
GCTGTGACCT CCACATCCCG CACGAAATTC GTACTGCTCA TCGGGTGCCT GTTGCGCCTC
GGCACCGCGC TGCACGCCGC CGCCGGCGTT GACGCAGCCG GCCAGCTGGT GCCGGTCGTA
TCCCAACACG AAAACGGCTT CGCTTTCGCG CGCCTCGACC GCGGCGTCCA GTTCCGCGTC
GGCGACGTCA CCACCAACGT CCTGCTCTAC GGCCCGTCGA TCGTCCGCGT GAATGCGAAC
CTCGGCCAGG CGCACACCAC GCAGCCCAGC CTCGCGGTGG TCGCCCAACC CGCCACGGTA
ACCTTCACCG TGGAGGACTC GCCGGAGGCG CTCACGATCC GGACGGCAAA ACTGAGCGTC
ATCGTCGCGA AAAAATCCGG CGCGCTGACC TTCCTCGGCT CCGACGGCCG GCTGCTCACC
CGCGAACGCG CCGCCGCTCC GGACGAGATC AAGGAGGTCA CGATCTCTGG CGCGCCCACG
TACGAAATCA GTCGCACGTT CACGCTCGCG CCGGACGAAT CGCTCTACGG TCTCGGCCAG
TACAACCGGC CGTACATGGA TTATCGCGGC CAGGAAGTCC TCCTCGTTCA GACAAACATC
GGGATCGTCG TTCCGTTCCT GATCTCCACC CAACGCTATG GGGTCCTCTG GGACATCTAC
TCGAAGATGA CCTTCAAGGA CGACGTCTCC GGTGCCACGC TCTGGGCGGA AAGCGCGCCG
GCGGGCGTCG ACTACTATTT CATCGCCGGC GACACGATGG ACGGCGTGAT CGCCGGTTAC
CGCACGCTCA CCGGCGCCGC GCCGATGCTT CCCAAGCAGG CGTTCGGCCT GTTCATGAGC
AAGGAGCGCT ATCCCACGCA GGAGCGGCTG CTCGAGGTGG CGCGCACCTT CCGCCAGGAA
GGGTTCCCCC TCGATTACAT CGTGCAGGAT TGGCAATACT GGGGCGGCGC CGATGGCACG
TGGAGCGGCA TGACGTGGAA CCAGGAGCGG TTCCCGGATC CCGCGGCGCT CACGAAAACG
CTGCACGAGG AGCTGCGCCT GAAGCTCATG GTCTCCATCT GGCCGTCGGT CGGCAACAAC
ACGGCCCTGG CGCGCGAGCT GGACGCGAAG GGGCTGCGCT TTGCCCCGCT GCACTGGATT
TCGAAGAACG CGCGCGTCTA CGACGCGTTC GGCCCCGAGG GCCGTGCGAT CTACTTCAAG
CACATCAAGC AAGGACTGCT CGACGTCGGC GTCGACGCAC TCTGGATGGA CGGCACCGAA
GTCGAGGTCG GCACCGCCTG CCACGATCCC GCCGCCGTCG AGCGTGACAT CAAGAACCTC
GGCCGCAACG CGCTTGGCGA CTTCACGCGC TACCTGAATC CCTACAGCCT CGAAACCACC
CGCGGAACTT ACGAGGGCCA GCGCGGAACG AGCGACCAGC GCGTCTTCAC GCTCACCCGC
TCCGCCTGGG CCGGCCAGCA GCGTTACGCC GCGCTGCCCT GGTCGGGCGA CACGACCGCG
AGTTGGGAAA CGCTCCGCCA TCAGATCGCC GGCGGCATCA ACATCGCCTT CGCCGGCCTG
CCTTACTGGA CGCAGGACAC CGGCGGCTTC TTCGTCAACT ATCCGGACGG CGAGCGAAAC
CCGGAATATC AAGAGCTCTA CGCCCGCTGG AACCAGTTCG CGATCTTCAA CCCGATCTAT
CGGATCCACG GGACCAATAT CGAGCGCGAG CCGTATCGCT TCAAAGCCTT CGCGCCCGCG
ATCTACGACT CACTTCTCTC CGCGGTCCAG CTCCGCTACG CCCTGCTGCC CTACCTCTAC
TCGCTCGCGT GGCAGACGAC GGCGCACGAC TACACGATGA TGCGCGGGCT GCCGATGGAT
TTCCCGGACG ATCCGGCGGT GCGGAAAACC GACGATGCCT TCATGTTCGG TCCCGCGTTT
CTCGTGCATC CGATCACGCA CGCGATGTAT CACGTCAGCG CGCCGCCCGC CGCCACGATT
CCGGCCGAGG CGCTCCGCAC GCCCGACGAC CAGCCGGGGC TCGCCGTGCA GTATTTCGCC
GGCGTTGATT TTGGCCGCGC CGTCAGCACG AGCGTCGATG AAAAGGTCGA GCACGCCTGG
CCCGGTCCGC CGCTTGCCAA TCCGCCGTCC GGTCTCGACG ACTTCGACAA CTTCTCCGCG
CGCTGGACCG GCACCGTGAC CGCGCCCGAG GCTGGCGACT ACGAATTCGG CGTCGAATAC
GACGACGGCG CGCGCCTTTA CCTCGACGGC AAACTGCTCG TCGATGACTG GAGCTACGGC
GCCAAGCGCT ACCGCAGCGC TCGCCTCACC CTCGCGGCCG GTCAGCAGGT CGCCGTGAAG
GCGGAGTTTC ACCAGGGCGG ACAGGAGCGC TACTTCCGGC TCGGCTGGCG CACGCCGAGC
GAGAGTCGCG CCCTCGCCGC AGCCAGGAAG GAACTCGACA ACACAATGTC GACCTATCTG
CCCGCTGGCG CAGCGTGGTA TGACTTCTGG ACCAACGAGC GCTTCGCGGG TGGCGCCACG
GTCACGAAGG CCTGTCCGCT CGACACGTTC CCGGTTTACG TTCGCGCCGG CTCGATCGTG
CCGATGGGCC CGGCCGATCT GCAATATGCG ACCGAGCGGC CTGACGCGCC CTACACGATC
CGCATTTATC CCGGCGCGAA CGCGACCTTC ACGCTCTACG AGGACGACAA CGAAACCTAT
GCCTACGAGC GCGGTGAGCG CGCGACCTAC GAGCTCACCT GGGACGATGC CGCCCAGACG
CTCCATCTCG GCGGTCGCCA GGGTTCGTTC CCCGGTCTGG TCGCGCAACG CCAACTCGAG
CTCGTTCTCA TCGGAGCGAA AACCCCGTCT GCTCCGACGA TCATCACCTA CACCGGCCAC
CCGATGGCCG TTAGCCTCGC CTTCAATGAA AGCCTCCTCG CTCAATAA
 
Protein sequence
MTPEPAFTPL AGEARRHRRL AVTSTSRTKF VLLIGCLLRL GTALHAAAGV DAAGQLVPVV 
SQHENGFAFA RLDRGVQFRV GDVTTNVLLY GPSIVRVNAN LGQAHTTQPS LAVVAQPATV
TFTVEDSPEA LTIRTAKLSV IVAKKSGALT FLGSDGRLLT RERAAAPDEI KEVTISGAPT
YEISRTFTLA PDESLYGLGQ YNRPYMDYRG QEVLLVQTNI GIVVPFLIST QRYGVLWDIY
SKMTFKDDVS GATLWAESAP AGVDYYFIAG DTMDGVIAGY RTLTGAAPML PKQAFGLFMS
KERYPTQERL LEVARTFRQE GFPLDYIVQD WQYWGGADGT WSGMTWNQER FPDPAALTKT
LHEELRLKLM VSIWPSVGNN TALARELDAK GLRFAPLHWI SKNARVYDAF GPEGRAIYFK
HIKQGLLDVG VDALWMDGTE VEVGTACHDP AAVERDIKNL GRNALGDFTR YLNPYSLETT
RGTYEGQRGT SDQRVFTLTR SAWAGQQRYA ALPWSGDTTA SWETLRHQIA GGINIAFAGL
PYWTQDTGGF FVNYPDGERN PEYQELYARW NQFAIFNPIY RIHGTNIERE PYRFKAFAPA
IYDSLLSAVQ LRYALLPYLY SLAWQTTAHD YTMMRGLPMD FPDDPAVRKT DDAFMFGPAF
LVHPITHAMY HVSAPPAATI PAEALRTPDD QPGLAVQYFA GVDFGRAVST SVDEKVEHAW
PGPPLANPPS GLDDFDNFSA RWTGTVTAPE AGDYEFGVEY DDGARLYLDG KLLVDDWSYG
AKRYRSARLT LAAGQQVAVK AEFHQGGQER YFRLGWRTPS ESRALAAARK ELDNTMSTYL
PAGAAWYDFW TNERFAGGAT VTKACPLDTF PVYVRAGSIV PMGPADLQYA TERPDAPYTI
RIYPGANATF TLYEDDNETY AYERGERATY ELTWDDAAQT LHLGGRQGSF PGLVAQRQLE
LVLIGAKTPS APTIITYTGH PMAVSLAFNE SLLAQ