Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1287 |
Symbol | |
ID | 6974692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1432229 |
End bp | 1433515 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643390816 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002275684 |
Protein GI | 209543455 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCGC CGCTTGTCGT CATTTTTCTC CTCGTTCTCA TCAATGGCAT CTTCGCCATG GGGGAACTGG CCCTGATCTC GGCGCGCCGC ACGCGCCTGA TGATCATGCA CCGCAGCGGG GTGAAGGGCG CCGAGCGGGC CCTGCGCCTG GCCGAGGACC CGCAGAGCTT CCTGCCCACG GTGCAGGTCG GCATCACCCT GGTCTCGATC CTGGAAGGTA CGTTCGGCGG CACGCAGATC GAAGGGTACC TGACGCCCTG GCTGGCGCGG TTTCCGGCCT TGCGTCCGTT CGCCGCCGAA CTGTCGATGA CGGTCGTGGT GGTGGCGATC ACCTCGCTGA TGCTGGTGCT GGGCGAACTG GTGCCCAAGC AGCTTGCGCT GCGCCATCCC GAAATCGTCG CGGCGCGGCT GTCGCTGCCG TTGGAAGGCC TGGCCCGCGT CACGCGCCCG GCCGTCTGGC TGCTGGGGCG GTCCTCGAAC CTGGTGCTGC GCCTGATGGG CGTGGGCGCC ATGACCCGCG AAGCCCTGAC CGAGGAAGAA CTGAAGGCCT ATATCGCCGA AGGCGCGCAG TCCGGCGTGC TGGAGCAGGA GGAGCGCGAC ATGATCGAGC GCCTGCTGCG CCTGGCGGAC CGGCCGGTCC GTGCCATCAT GACGCCGCGC AACGAGCTGT TCTGGATCGA ACGCCACGCC AGCCGTGAGG AACTGCGCCG TACCCTGCGC AACACCGTCT ATACCCGCAT CGTGGTCTGC GACGGCGGGG TCGACAATCC GGTGGGCGTC ATCCTGGCGA AGGACATGCT GGACCGCCTG CTGGACGGGC GGCCGGTCAC GATCGAATCG GGGCTGCGCA AGCCCGTGGT GGTCCCCGAC ACGATCTCGG CCTTCGACAT GGTCGAACGG ATGCGCACCG TGCCCCTGGG CATCGCGCTG GTGCTGGACG AATACGGGTC GTTCGAAGGG ATCGTGACCG CCTCGGACCT GTTCGAGGCC ATCGTGGGCG AACACCACGA ACCGGGCAGC ACCCCCAAGA AGCGCATGGC GCAGGACGAC GTGCTGATCC TGGACGGCTT CATGCCCGCC GACGAGGTCA AGGACCGCCT GGGCCTGTCC GACCTGCCGG ACGAAGGCAG CTACCACACG CTGGGCGGCC TGATCCTGGC GCTGCTGCGC CGCGTGCCGG CGACGGGGGA CAAGGTCGTG TTTTCCGGCT GGCTGTTCGA GGTCCTGGAG ACCGACCAGC GGCGGGTTGT GAAGGTCCGG GCCAGCCGTC AGGCGCTGGC GGACTGA
|
Protein sequence | MIAPLVVIFL LVLINGIFAM GELALISARR TRLMIMHRSG VKGAERALRL AEDPQSFLPT VQVGITLVSI LEGTFGGTQI EGYLTPWLAR FPALRPFAAE LSMTVVVVAI TSLMLVLGEL VPKQLALRHP EIVAARLSLP LEGLARVTRP AVWLLGRSSN LVLRLMGVGA MTREALTEEE LKAYIAEGAQ SGVLEQEERD MIERLLRLAD RPVRAIMTPR NELFWIERHA SREELRRTLR NTVYTRIVVC DGGVDNPVGV ILAKDMLDRL LDGRPVTIES GLRKPVVVPD TISAFDMVER MRTVPLGIAL VLDEYGSFEG IVTASDLFEA IVGEHHEPGS TPKKRMAQDD VLILDGFMPA DEVKDRLGLS DLPDEGSYHT LGGLILALLR RVPATGDKVV FSGWLFEVLE TDQRRVVKVR ASRQALAD
|
| |