Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0844 |
Symbol | |
ID | 6974241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 959836 |
End bp | 960846 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643390373 |
Product | hypothetical protein |
Protein accession | YP_002275249 |
Protein GI | 209543020 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.655414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0599894 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCG CCGCCCTTGG CGCCTCGATG GCGGCGCTGG GCGGCATCCG GCGCGCCGCC GCTAAAACCG TCAGCCATCC CCGATTCCGT CTGGTTTTCG TCAACCATGT CACGACCAAT CCGTTTTTCA CGGCGACACA GTATGGACTG CGCGACGCGG CGGCCCTGGT CGGGGCGGAT ACCCAGTGGA CGGGGTCGGA AAACAGCATC GCGGCCGAGA TGATCACCGC GATCAATGCC GCCATCGCCG CCAAGGCCAG CGCGATCGCC GTCTGCCTGG TCGATCCGCA TGCCTTCAAC GATCCGGTCG AACGCGCGCT GGCCGCCGGA ATCCCGGTCT TCGCCTATAA CGCGGACGCG CCGGCGGGGT CGGGCAACAA GCGCCTGGCC TATATCGGGC AGGATCTGTT CAAGGCCGGG CAAATGATGG GCCAGCGGAT TCTCGACCTG GTGCCCGGCG GGCGCGTGGC GCTGATGATC GCGACACCCG GGCAGTTGAA CATCCAGCCG CGTATCGACG GCGCGCAGGA CATGCTGCGC AAGAGCGGCC GGTCGTACCA GATCGACATC GTCGCGACGG GCGCCACGGT GAACGAGGAA CTGTCCAAGG TGAAGGCCTA TTACCTGGGC CATTCGGACG TGAAGGGAAT GTTCGCCGTC GATGGCGGCA CCACGCAGTC CGTCGCCGAC ACGATGGCGC AGTACGGCCT GGCCGCCAAG GGCGTGCGGG GCGGCGGCTT CGACCTGCTG CCGCGCACGC TGCGCCTGAT CAATGACGGG CACCTGGATT TCACCATCGA CCAGCAACCC TATCTGCAGG GCTACTACAC GGTCATGGAA ATGTACACCT ACCTGATGTC CGGGGGCCTG GTGGGACCGG CGGAAATCAA TACCGGCCTG AAATTCGTGA CCAAAGGGGA TGTGGCGCCA TACCTGGCGA CCAAGAGCCG CTACGAGGGC AGTTCGAGCG AGGCGCAGTT CATCCCGCGC TCCGGCCCGA TCCAGTCCTA G
|
Protein sequence | MQTAALGASM AALGGIRRAA AKTVSHPRFR LVFVNHVTTN PFFTATQYGL RDAAALVGAD TQWTGSENSI AAEMITAINA AIAAKASAIA VCLVDPHAFN DPVERALAAG IPVFAYNADA PAGSGNKRLA YIGQDLFKAG QMMGQRILDL VPGGRVALMI ATPGQLNIQP RIDGAQDMLR KSGRSYQIDI VATGATVNEE LSKVKAYYLG HSDVKGMFAV DGGTTQSVAD TMAQYGLAAK GVRGGGFDLL PRTLRLINDG HLDFTIDQQP YLQGYYTVME MYTYLMSGGL VGPAEINTGL KFVTKGDVAP YLATKSRYEG SSSEAQFIPR SGPIQS
|
| |