Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2091 |
Symbol | |
ID | 6975518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2320687 |
End bp | 2321934 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643391620 |
Product | hypothetical protein |
Protein accession | YP_002276465 |
Protein GI | 209544236 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.030072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCGA GTCTGTTCCG CGCCGGCCTT CTCGCCGGTG CGATCGTTGC ACTTCCCTTT TCGATGTCCG TCGCCGGGTC TGCCCAGGCC GCGGATACAT TGAGCATGAA GGTCGGCAAG CCGCTGCAGC AGGCGCAGAC CGCCCTGGCC GCGCACAAAT ATGCCGCGGC GATGAGCGCC GTCGACACCG CCGAGGCGGT GCCGGGCAAG ACCGACTACG AAGAATACAC CATCGCGCAG ATGCGCGCCG CCGTCGCGGC GCAGTCGGGT GACGTGGCGG CCGCCACCGC CGCCTATGAC AAGCTGATCG CCTCCAGCCG CACCCCCAAG GATGCCAAGC TGCAGATGAT GATGGCCGAG GCCACGATGG CCTACACCGC CAAGGACTAT CCGCGCGCGG TGACAGGGAT CGAGCGGTAC CTGCACGCGG CGGGCAGCAA TCCGGCGATG GAAACGCTGC TGATCCAGTC CTACTACCTG CAGCGGGACT ATGCGAACGC CGCCCGGGTG CAGCAGGCGC AGATCGACGC GGAAGTGAAG GGCGGCAAGG TCCCGACCGA ATCCCAGCTT CAGCTTCTGG CGGCCTGTCA GACGCAGCTC AAGGACCTGT CGGGCCTGAA CCGCACTTAT GTCACGCTGG CGTCCTATTA TCCCAAGCCG GAATACTGGG CGCTGATCCT GCATGGGCTG ATGGTCAACC CCAAGGTTCC GCCGGGGCTG CAACTCGACA TCTATCGCAT CCGTCAGGCG GCGGGCGTGC TGACCGCGCC GGCCGACTAC ATGGACATGA CCGAACTGGC CATGCAGGCC GGGTTGCCGC AACTGGCGCT GGACCTGATG AATGCCGGCT ATGCCAGCGG CGCCCTGGGC AAGGACGCGG GCGCCGCGCG CGAAGCCCGG CTGAAGGCCA TGGTCGTGAA GGCGGTTGCG GACAAGAAGG CTACGATCGC GGCGGACGAA GCCGCCGCCG TCAACGCCCC GACCGGGAAC GCCCTGCTGA CCGCAGGCTA CAACTATGTG ACCTTCGGCC AGGTGGACAA GGGACTGGCG GTGATGCGCC AGGGGCTGGC GAAGGGCCCG CTGGACCCGA ACATCGGCAA GCTGCATCTG GGCCTGGCCC AGGTGGCGGC CGGCCGTACC GCCGACGGGA TCGCGACGTT GAAGACCGTG GACGGCGACA ATGGCGCGCG CGACATCGCG CAATTGTGGA TCCTGAAACT GACCCCCGCT GCGGCGGCTC ATCACTGA
|
Protein sequence | MSSSLFRAGL LAGAIVALPF SMSVAGSAQA ADTLSMKVGK PLQQAQTALA AHKYAAAMSA VDTAEAVPGK TDYEEYTIAQ MRAAVAAQSG DVAAATAAYD KLIASSRTPK DAKLQMMMAE ATMAYTAKDY PRAVTGIERY LHAAGSNPAM ETLLIQSYYL QRDYANAARV QQAQIDAEVK GGKVPTESQL QLLAACQTQL KDLSGLNRTY VTLASYYPKP EYWALILHGL MVNPKVPPGL QLDIYRIRQA AGVLTAPADY MDMTELAMQA GLPQLALDLM NAGYASGALG KDAGAAREAR LKAMVVKAVA DKKATIAADE AAAVNAPTGN ALLTAGYNYV TFGQVDKGLA VMRQGLAKGP LDPNIGKLHL GLAQVAAGRT ADGIATLKTV DGDNGARDIA QLWILKLTPA AAAHH
|
| |