Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2512 |
Symbol | |
ID | 6975942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2763642 |
End bp | 2765459 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643392030 |
Product | protein of unknown function DUF885 |
Protein accession | YP_002276871 |
Protein GI | 209544642 |
COG category | [S] Function unknown |
COG ID | [COG4805] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.141981 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCAACAC GTTCCCCCTT CGGCACGATC CTGCGGTCTG TCAACCGAAA GGGCACGTTC TTGAAAAGAC TCCTCCTGGC CTCGATCGCC CTCTTTCCCG GCCTGGCGCA CGCGGCCGAA GACCGGAACG CGTCCTTTCA GGCCCTGCTG GCGCAGCAAT GGGAATGGCA GTTGCGCGAG AGCCCGGAAC TGGCGACGAC CATCGGGGAC GACCGCTATA ATGATCGCTG GTCCGACATG TCGCTGGCCC ATGTCGCGGT TCAGGACCGC GCGCTGAAGG ACTTCCAGAC GCGGTTCACG GCCTTCGGAA CGGCGGGATT GTCGGAACAG AACCGCCTCA GCCGCCAGAT GATGCTCCAG CAACTGGCGA TCGACCGGGA GGCGATCCGG CTGAAGATTT ATGAAATGCC GATCGACCAG ATCAATGGGG TGCCGTTGCA ACTGGCTGGT TTCGTGTCCA GCATTCCGTT CGATTCCGTC AGACACTATC AGGATTACAT CGCCCGCCTG CACGCCATTC CGGCGGTCCT GCAGGCGGTG GTCGACAGCG CGCGGGCCGG CATGGCGGAC CATCTGATGC CGCCGCGCTT CCTGCTGGAA AAGGCGATTC CGCAATTGCG CGGCATCGCG TCCCCGGCCG GGACGGACAA TGTGTTCGGA CAGCCGGCCC TGCATTTTCC GGCGACGATT CCGGCGGACC AGCAGGCGGC GCTGCGGGCG CAGATCATCG CCGCCGTCGA TCGCGAGGTC CGCCCGGCGT TCGAGAAGAC GGCCGATTTC GTCGAAAAGG ACTACGCGCC GCACGGCCGG GCGCAGGACG GGATCTGGGC GCTGCCCGAC GGGACCGCGC GCTATCGCTT CGCCATCCGC CAGCAGACCA CCAGCACCAT GACCCCCGCC GCGATCCACG CCCTGGGCCT GGCCGAAGTC GCGCGGATCG AAAAGCAGAT GACCGATGTC GCGCACCGGC TGGGCTTCGC GGACCTCGCG GCGCTACGGG CGTCGGTCCA GACCGACCAT AAGCTGTTCG CGACCTCGCG CGAGGAAATC CTGGACCGCT ACCGCACCTA TATCGCCGGA ATGCGGCCCG AACTGCCCAG GCTGTTCGGC ACGCTGCCCA AGGCGGACGT GAAGGTGCAG GCGGTCGAAA CCTACCGCGA GGCCGAGGCG CCGGACGCCG AATACCACCA GGGTACGCCC GACGGGTCGC GCCCCGGCAT CGTGTTCGTC AATACCGGGG ATTTCGCGCA TCGCGACCTG TACACGATCG AGGACACCGC CTATCACGAG GGCATTCCCG GCCACCATCT GCAGATCGCC CTGGCCCAGA CCCTGCCCCT GCCGCCTTTC CGCCAGCAGG GCAGCTATAA TTCCTATGTC GAGGGATGGG CGCTGTACGC GGAACGGCTG GCGAAGGATT ACGGGTTCTA CAAGGACGCC TATAACGAAT ATGGCTGGCT GAACGGCGAA CTGCTGCGCG CCGACCGGCT GGTTCTGGAT ACCGGCGTGC ACTATCTGCA CTGGACGCGC CCGCAGATGA TCGCGTTCTT CCGCGCCCAT CCATCGGAAA GCGAACCCGG CATGCAGGCC GAAACCGACC GCTACATCGC CTGGCCAGGC CAGGCGCTGG GCTACAAGCT GGGCCAACTC GAAATCCTGG ATCTTCGCGC CAGGGCGCAG AAAGCGCTGG GCCCCAAATT CGATATCCGC GCCTTCCACG ACGAAATCCT GGGCGGCGGC GCACTGCCGC TGGACATGCT GGACCAGCGA GTTACGCAAT GGATCGCGGC CCAACGCGCC GCCCTGAAGC CGTCATGA
|
Protein sequence | MPTRSPFGTI LRSVNRKGTF LKRLLLASIA LFPGLAHAAE DRNASFQALL AQQWEWQLRE SPELATTIGD DRYNDRWSDM SLAHVAVQDR ALKDFQTRFT AFGTAGLSEQ NRLSRQMMLQ QLAIDREAIR LKIYEMPIDQ INGVPLQLAG FVSSIPFDSV RHYQDYIARL HAIPAVLQAV VDSARAGMAD HLMPPRFLLE KAIPQLRGIA SPAGTDNVFG QPALHFPATI PADQQAALRA QIIAAVDREV RPAFEKTADF VEKDYAPHGR AQDGIWALPD GTARYRFAIR QQTTSTMTPA AIHALGLAEV ARIEKQMTDV AHRLGFADLA ALRASVQTDH KLFATSREEI LDRYRTYIAG MRPELPRLFG TLPKADVKVQ AVETYREAEA PDAEYHQGTP DGSRPGIVFV NTGDFAHRDL YTIEDTAYHE GIPGHHLQIA LAQTLPLPPF RQQGSYNSYV EGWALYAERL AKDYGFYKDA YNEYGWLNGE LLRADRLVLD TGVHYLHWTR PQMIAFFRAH PSESEPGMQA ETDRYIAWPG QALGYKLGQL EILDLRARAQ KALGPKFDIR AFHDEILGGG ALPLDMLDQR VTQWIAAQRA ALKPS
|
| |