Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0848 |
Symbol | |
ID | 6974245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 964466 |
End bp | 965827 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643390377 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002275253 |
Protein GI | 209543024 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0413942 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.363265 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCATT CCACGACTTC GCGGCGCTGG TCCGGTGCCT CGATGCAGAT TCTCGGCATG CTGGTCCTGG GCACCGCATC CTGCCTCGGC ATGGGCGCGG CGTCGCCGGC CCATGCGGCC GGCACGCTGA CGATCGCCAC GGTCAACAAT GCCGACATGG TCGTCATGAA ACAGTTATCG GGCGAATTCG AGACGGCGCA TCCCGACATC CACCTGAACT GGGTGACGCT GGAGGAAAAC GTCCTGCGCC AGCGGGCGAC GACCGACATC GCGACCCATT CCGGCCAGTT CGACATCCTG ACGATCGGCA ATTACGAGGT GCCGATCTGG GCCAAGCAGG GATGGCTGGC AGAACTGCAC CCCGATGCCG CCTATGATGC GGACGACATC CTGCCGGCGG TACGTGCCGG CCTGACGGTG AACGACAAGC TGTATGCCCT GCCGTTCTAT GCCGAAAGCG TCATGACCTA TTACCGCAAG GACCTGTTCG CCAAGGCCGG GCTGACGATG CCCGACGCCC CGACCTATGA CCAGATCCGC ACCTTCGCCG ACAAGATCAC GGACAAGGAC AGCCAGACCT ATGGCCTGTG CCTGCGCGGC AAGCCGGGCT GGGGCGAGAA CATGGCCTAT GTCACGTCGC TGGTGAACAC GTTCGGCGGC CAGTGGTTCG ACATGACCTG GCACCCCATG CTGAACAGTC CGGAATGGAA GGCGGCGCTG ACCTGGTACG TGTCGGCCCT GAAGGCCGAT GGCCCGCCCG GGGCGACATC GAACGGATTC AATGAAAACC TGGCGCTGTT CGCCAGCGGC CATTGCGGCA TCTGGATCGA TTCCACGGTG GCCGGCGGCC TGCTGTTCGA TCCGGCGCAA TCACACGTGG CCAATACGGT AGGCTTCGCT TCCGTGCCGC TGGGGCCATA CGGCAAGGGA CCGACCTGGC TGTGGAGCTG GAACCTGGCC ATTCCCGCAT CGTCCACCCA TGTGGCCGAC GCCCAGACCT TCATCACCTG GGCGACGTCG AAGGCCTATG TGCAACTGGT GGCGAAGAAC CGCGGTTGGG TCGCAGTGCC CGCCGGCACG CGCCTGTCGA CCTACAACAC CCCGGAATAC CAGAAGGCAG CGCCGTTCGC GGCGTTCGTC CATAACGCCA TCGACCATGC CGATCCGAAC GGGCCGACGA AGCAGCCGCG CCCCTATGGC GGCGCGCAGT TCGTCGCCAT CCCGCAGTTC CAGGCCATCG GCACCCAGGT CGGCCAGAGC GTCGCCGCCG CCCTGTCGGG CCAGACGACG GTCGAGCAAA CCCAGGCCTC GGCCCAGGCG TTGGTGACAC GCACCATGCG GCAGGCAGGC CTGCTGCATT AG
|
Protein sequence | MAHSTTSRRW SGASMQILGM LVLGTASCLG MGAASPAHAA GTLTIATVNN ADMVVMKQLS GEFETAHPDI HLNWVTLEEN VLRQRATTDI ATHSGQFDIL TIGNYEVPIW AKQGWLAELH PDAAYDADDI LPAVRAGLTV NDKLYALPFY AESVMTYYRK DLFAKAGLTM PDAPTYDQIR TFADKITDKD SQTYGLCLRG KPGWGENMAY VTSLVNTFGG QWFDMTWHPM LNSPEWKAAL TWYVSALKAD GPPGATSNGF NENLALFASG HCGIWIDSTV AGGLLFDPAQ SHVANTVGFA SVPLGPYGKG PTWLWSWNLA IPASSTHVAD AQTFITWATS KAYVQLVAKN RGWVAVPAGT RLSTYNTPEY QKAAPFAAFV HNAIDHADPN GPTKQPRPYG GAQFVAIPQF QAIGTQVGQS VAAALSGQTT VEQTQASAQA LVTRTMRQAG LLH
|
| |