Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0022 |
Symbol | |
ID | 6973411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 26417 |
End bp | 27526 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643389555 |
Product | hypothetical protein |
Protein accession | YP_002274439 |
Protein GI | 209542210 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.205908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.027287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCATG CCCCGACCGC CGCTTCCAGC CGGGCCAAGA CGGGGGGGGA TGCGGCACGG CAGCCCGCGC CGCTGTCGAT GTTCGAATTC TGGCCGGGCT GGGCCATCTA TACCCCGGTC GTGCTGTACT GGATCCTGCT GGGCCTGTGG CACCGGGATT TCAGCCTGCC GACCGCGGCC AATCCCCGCA TCCTGACCGG CGGCCTGTGC GGCGAAAGCA AGACCAGCAT CCTGGACATG GCGGGCGAGA CGGCGCGGCG CTGGATCGCG CCCTATGTCT CGGTCACCAC CGGTTCGGCC GATGACGGCG CGGCCGCCCT GGCGGCGCTG GATCGCGGCG GGCTGGCGCT GCCGGTGGTG GTGAAGCCCG ATATCGGCTG CAACGGCGCC GGGGTGAAGC TGGTCACGAC CCCGGACGAA CTGGTCGCGG CGGTGGCGCT GTACCCGCCC GATACCCCCC TGGTCATGCA GCGGCTGATC CCGTTCGAGC ACGAGGCCGG CGTGTTCTAT ATCCGCCACC CAGACGAGGA CCGGGGCCGG ATATCCTCGC TGACCTACAA GGAGGCACCG GTCATCGTCG GCGACGGCCG GTCCACGGTG CGACAACTGA TCGATGCCGA CGCGCGCACG CGCCTGGTGC CGCATCTGTA TCTGCCCCGC CTGGGCGATC GGGTGCATGA GGTCCTGCCG GCGGGCATGC CGCTGCGGCT GGTCTTCGCC GGGAATCACA GCAAGGGGTC GATCTTCCGC AACGGCGCGG ACGACATCAC CCCGGCCCTG GTCGAGCAGA TCGACCGGAT CATGCAGGAT ATCCCCGATT TCCATTTCGG CCGGATCGAC CTGAAGTTCG AATCCATCGC CGCCCTGCGC CTGGGCCGGG GGTTCGAAAT CATCGAGATC AACGGCGTGG GGTCCGAAGC GACCCATATC TGGGATTCGC GCACCACCCT GCGCGAGGCC TATGCGGCGC AGTTCACGCA TTACCGCGAG ACCTTCCGCA TCGGCGCCAA GAAGAAGAAG GCCGGATGGC GGACCAGCGG CGCCTTCACC ATGCTGCATT ACTGGCGCCA GCAGAGGCGG CTGCTCGCCT CCTACCCCCT GAACGACTAG
|
Protein sequence | MNHAPTAASS RAKTGGDAAR QPAPLSMFEF WPGWAIYTPV VLYWILLGLW HRDFSLPTAA NPRILTGGLC GESKTSILDM AGETARRWIA PYVSVTTGSA DDGAAALAAL DRGGLALPVV VKPDIGCNGA GVKLVTTPDE LVAAVALYPP DTPLVMQRLI PFEHEAGVFY IRHPDEDRGR ISSLTYKEAP VIVGDGRSTV RQLIDADART RLVPHLYLPR LGDRVHEVLP AGMPLRLVFA GNHSKGSIFR NGADDITPAL VEQIDRIMQD IPDFHFGRID LKFESIAALR LGRGFEIIEI NGVGSEATHI WDSRTTLREA YAAQFTHYRE TFRIGAKKKK AGWRTSGAFT MLHYWRQQRR LLASYPLND
|
| |