Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0071 |
Symbol | |
ID | 6973460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 80561 |
End bp | 82462 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643389604 |
Product | surface antigen (D15) |
Protein accession | YP_002274488 |
Protein GI | 209542259 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0729] Outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.309783 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.034078 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGCTGG TCGTCTTCGT GGGCTGGATG CCGCCGCCCG TTCGCGGCGC GGACCCGCAA TCCTACGTCA CCGTGATCCG CCCGACCGGC CAGGGCGACC TGGACGCGGC GATCAGCGCG TCCTCCAGCC TGCTGTCCCT GCAGAAGACC AAGGCCGTCA GCCCGTTCGC GCTCGCCGGC CGCATCCGCA ACGATTACGA CCGCCTGCGC ACCGCCCTGG AAAGCTACGG CTACTACGCC GCGACGATCC GCATCGCCGT CGGACTGCGC GCCGGGGGGC ACGCCGCGCC GTCGGCGCCG CCGGCGACGA TGGATGGCCA GGACCCGCGC CTGCCGGAAT GGCTGCTGGC GGTTCCCAAG GGGCAGGCCG TGCAGGTCAC CATCACGCCG GTCCGGGGCG ACATCTTCCA TCTGGGGCAG GTGACGCTGA AGCCGTCGCC GGAGGACGGC ACGGCCCCGA TCGTCCTCAA CGCCCCGGAA CGCACCGCCC TGGGCGTGGC CTCGGGCCAT CCGGCCATCG CGTCCGACGT GCTGGCGGGC GGGGTGAACC TGCAGGCGGA ACTGAAGGAG GAAGGCCACG CCCTGGCGCA GGTCGGCACG CCCAAGGCCT GGCTGCGGCC CCAGACCCAT ACGCTGGACG TCGAATACAC CGTGCGGCGC GGGCCGATCG TGACGATCGG CGCCATCGCG CTGTCCGGGC TGAAGCGGAC CCATCCCGCC TATATCGCGC GGCGGATCAC CCTGCACCCC GACCAGCTTT ACCAGCCGTC ACACATCGAG GCGGCGCGGC AGGACCTGGC GTCGCTCGGC GTGTTTTCCG ACGTGCAGGC CAGCGACGCG CCGCCGCTGA CGGCTGGCCG GCAAATGCCG CTGAACTTCG CCTTCACCGA GGGCAAGCAG CGGATGGCGG AGGTGGAGGG CGGATATTCC ACCGACCTGG GCGGCCGGGG CGGCGTAAGC TGGACGCACA ACAACATCTT CGGCAATGCC GAGCGCCTGC GCCTGACCAC CCTGGTGACG GGACTGGGCG GTTCGGCGCA GCAGGGGCTG GGCTATGACG TATATGCCGA CCTGCTGAAG CCGGATTTCG GTGACCGCGA CCAGAACCTG AGCGTGCGGG TCGAGGGAAT CCGCCAGTTG CTCTATTCCT ACCGGCAGAC GGCGCTGCTG GTCCGCGCGG GCATCGTCCG CCATCTGGGG CGGCGATGGA CGGTGTCTTT CGGCGGCGAG GCCGAACAGG AACATATCGA ACAGATGGGG ATGTCCAACG ACTACACCAT CGTGTCCCTG CCCCTGTCCG CGACCTATGA CAGCACGGGG CTGACCAACC CGATCGACCC CGCGACCCAC GGGGTGCGCA TCGCCGCCAG CGCGACGCCC TCGGCCTCCC TGATCAGCGG CACGTCGTTC TTCACCATCC TGCAGGCGAC GGCATCCACC TATTTCGACC TGTCGCATGT GGGCCTTTCG CGGCCCGGGC GCAGCGTCTT CGCGTTTCGC GGCGTCGTCG GCAGCGTGCA GGGGGCTTCG ACGTTCGAGA TTCCGCCTGA TCAACGCCTG TATGCCGGCG GCAGCGCGAC CGTGCGCGGC TTCCGCTACC AGGGCGTGGG GCCGCAATTT CCCAACAGCA AATACGCGAT CGGCGGCACG TCGATGGATG CGGGCACCGT GGAATTCCGC CAGCGCCTGT TCCGCAGCTT CGGCGCGGCG CTGTTCGCCG ATGCCGGCCA GGTCGACACC GGCAGCAGCC CCCTGCATGG CACGCTGCGC GTCGGCGCAG GGGCAGGGGT GCGGTACTAT ACGCCGATCG GCCCGGTGCG GGTGGACGTC GCGTTCCCGC TGAACCGGCC GGCGCAAGGC GATACGTGGG AACTCTATAT CGGCCTGGGG GAAACCTTCT GA
|
Protein sequence | MGLVVFVGWM PPPVRGADPQ SYVTVIRPTG QGDLDAAISA SSSLLSLQKT KAVSPFALAG RIRNDYDRLR TALESYGYYA ATIRIAVGLR AGGHAAPSAP PATMDGQDPR LPEWLLAVPK GQAVQVTITP VRGDIFHLGQ VTLKPSPEDG TAPIVLNAPE RTALGVASGH PAIASDVLAG GVNLQAELKE EGHALAQVGT PKAWLRPQTH TLDVEYTVRR GPIVTIGAIA LSGLKRTHPA YIARRITLHP DQLYQPSHIE AARQDLASLG VFSDVQASDA PPLTAGRQMP LNFAFTEGKQ RMAEVEGGYS TDLGGRGGVS WTHNNIFGNA ERLRLTTLVT GLGGSAQQGL GYDVYADLLK PDFGDRDQNL SVRVEGIRQL LYSYRQTALL VRAGIVRHLG RRWTVSFGGE AEQEHIEQMG MSNDYTIVSL PLSATYDSTG LTNPIDPATH GVRIAASATP SASLISGTSF FTILQATAST YFDLSHVGLS RPGRSVFAFR GVVGSVQGAS TFEIPPDQRL YAGGSATVRG FRYQGVGPQF PNSKYAIGGT SMDAGTVEFR QRLFRSFGAA LFADAGQVDT GSSPLHGTLR VGAGAGVRYY TPIGPVRVDV AFPLNRPAQG DTWELYIGLG ETF
|
| |