Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2046 |
Symbol | |
ID | 6975473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2267361 |
End bp | 2269562 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643391576 |
Product | Alpha,alpha-trehalase |
Protein accession | YP_002276421 |
Protein GI | 209544192 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCCCG TTCATGTGAC TTCAGCCCGT CTTTCTCTCC GCGCAGGATC GTGCCTCGCG GCCGCTCTGG GGGCGTCGCT GGCGCTGTCC GCTCCGCTGG CCGCCTTCGC CGATACGCCG CCTTCCGCGC CGGATTCCCT GGCCGCCGCC ATGGCGGCGA TGCAGCCGCC GCCGGCCATG CCGGCGGACG GGGCGGCCGA CCTGTCGGCC GCCGACCTGC CGCCGGTGGT CCTCTTCCTT CCGCTGTCCC CGGTGCCCGC CGGCCCCACG GTCGCGGTGC CCACCGCCGA CCAGCAGGAC CTGCGGCCGC CTTCGATCGC GCTGGCCGGC CTGTTCGCCG CCATGGGGGC GGCCCATGTC TTCGCCGATG CCAAGACGGC GGCCGACGCC ATCCCGGACG AGGCCTCGGA CGCGCTGCTG GCGGATTACG AACGGCAGAA GGTCCGTCCC GGATTCGACC TGAAGGATTT CGTGGCGCAG CATTTCGCGA TCGCCCCGCG CCGGACCGTT TCCTATCGCC GCCGCCCGAA CGAAAGCGTG CGTGACTATA TCAGCGGCAT GTGGGAGGTC CTCAGCCGTC CGCCCGACAC GCTGGTTGCG CATTCGTCGC TGCTGCCGCT GCCCGAAACC TATGTCGTCC CAGGCGGCCG GTTCAGCGAG CTGTATTACT GGGACACCTA TTTCACGATG ATCGGGCTGT ACGAGGACGG CCGGATCGAC CTGATGCGCG GCATGGTGCG CGACATCGCC TCGCTGATCG ACCGGTACGG GCACATGCCC AACGGCAGCC GCACCTATTA CCTCAGCCGG TCCGAACCGC CGTTCTTCGC GCTGATGATC GACCTGCTGG CGATGCATGA CGGGCAGGTC GCCTACACGA CGTTCCTGCC GGAACTGCAG CGGGAATACG ATTACTGGAT GGACGGGGCC GATAGCGTGG CCCCCGGCGC CGCCTGGCGG CATGTCGTGC GCCTGCCCGA CGGCACGCTG ATGAACCGCC ACTGGGACGA CATGGACACC CCCCGGGACG AAAGCTTCCC GCAGGATATC GCCACCGCCG CCCAGTCGTC GCGTCCGGCG GCCCAGACCT ACCGCGACCT GCGCGCCGGG GCGGAAACCG GGTGGGACTA TTCCTCGCGC TGGCTGGCCG ACGGGCACAG CATGGCCACG ATCCACACCA CCGACCTGCT GACGATCGAA CTCAACTGCC TGATCGCGCA CCTGGAACAG ACCCTGTCCC ACGCCTATGA CCTGCGGGGG AACAAGGCGC AGGCCGACCG GTACGCCACG CTGGCCACCG CGCGGATCGA TGCCATCCGC CGGGTCCTGT GGGACCCGAA GCGCGGGGCG TTCTTCGATT ACGACTGGAA GACGCGCACC CTGTCGCCCG TCCTGTCGGC GGCCACCGCC ATGCCGCTGT TCCTGCAGAT GGCGACACCC GAGCAGGCCC GCGCGGTGGC CGAGACCATG CGGACGAAGC TGCTGAAAGT CGGCGGCCTG ACCGCCACCG ACCATGTCAG CGGGCAGCAA TGGGATTCGC CGAACGGCTG GGCGCCGGAA CAATGGATGG CGATCAAGGG GCTGAACCAG TACGGCCTGG ACGACCTGGC GCAGCAGATC GCCTCGCGCT GGATGGAGCG CGTGATCGGC ACCTACGAGA AATCGGGCGT GTTGCTGGAA AAATACGACG TGGTGAACCC GTCCATCAGC CCCACCGGCG GCAAGGGCGG CGGCGAATAT CCGATGCAGG TCGGGTTCGG CTGGACCAAC GGCACGTTGC TGGGCCTGAT GAACCGCTAT CCGCAGGACA CCCGCGTGGT CCTGGACCGC AATCCACGCG CCGAACAGCC CTTCGCCCAG CCCCTGCCGC CGGTCGATGC CTACCGGGTC CAGGGGCCGA CGACGGCGCC GGCCCCGGTT CCGCTGCACC CCACGCCGCC GCCGGCCGCC ACGGTGCAGG ACAAGCCGGT CCCGTCCGGG GCGCCGGTTC CAGCCGTGTC GGGCCCGGCC CCCCCTGCGC CGGCGCCTGT GGCCGCCGGA TCAGACGCGG CCGACGCCGC GAAGCCGGTG CAGGGTCACG ATGGCGATGG TCAGAAGGAC GGCGGCCCAG GTCTGGTGCA CGGTTCCGGC CCAGGCGGGA ACGACCAGAA GCAGGGTGGT GATGCCCAGC GCGTACTGAA TCAGGACCGC CCAGCCCATT AG
|
Protein sequence | MTPVHVTSAR LSLRAGSCLA AALGASLALS APLAAFADTP PSAPDSLAAA MAAMQPPPAM PADGAADLSA ADLPPVVLFL PLSPVPAGPT VAVPTADQQD LRPPSIALAG LFAAMGAAHV FADAKTAADA IPDEASDALL ADYERQKVRP GFDLKDFVAQ HFAIAPRRTV SYRRRPNESV RDYISGMWEV LSRPPDTLVA HSSLLPLPET YVVPGGRFSE LYYWDTYFTM IGLYEDGRID LMRGMVRDIA SLIDRYGHMP NGSRTYYLSR SEPPFFALMI DLLAMHDGQV AYTTFLPELQ REYDYWMDGA DSVAPGAAWR HVVRLPDGTL MNRHWDDMDT PRDESFPQDI ATAAQSSRPA AQTYRDLRAG AETGWDYSSR WLADGHSMAT IHTTDLLTIE LNCLIAHLEQ TLSHAYDLRG NKAQADRYAT LATARIDAIR RVLWDPKRGA FFDYDWKTRT LSPVLSAATA MPLFLQMATP EQARAVAETM RTKLLKVGGL TATDHVSGQQ WDSPNGWAPE QWMAIKGLNQ YGLDDLAQQI ASRWMERVIG TYEKSGVLLE KYDVVNPSIS PTGGKGGGEY PMQVGFGWTN GTLLGLMNRY PQDTRVVLDR NPRAEQPFAQ PLPPVDAYRV QGPTTAPAPV PLHPTPPPAA TVQDKPVPSG APVPAVSGPA PPAPAPVAAG SDAADAAKPV QGHDGDGQKD GGPGLVHGSG PGGNDQKQGG DAQRVLNQDR PAH
|
| |