Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0646 |
Symbol | |
ID | 6974043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 734894 |
End bp | 736363 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643390177 |
Product | sulfatase |
Protein accession | YP_002275053 |
Protein GI | 209542824 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.69317 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCTT GGGCCATTGC TGTCGTATCG GTTCTCGGGC TCGTCACCCT TTTCGTGCTT CGGTTCACGA GTGGCAAGCT GACCCGATCG TTTCTCAACG ATATCCCGGA CATCGTGATC GGCATCACGC TGTGGCTGGC GCTGACATTG CTGACCCAGC GGCCGGCCTC ATCGGCCCTG CTGTGGACGG TTCTGGCCGC GGGCCTGCTG CTGGCCGACC ACACCAAACG CCAGGTCCTT CGGGAACCCG TTGTCTTTGC CGATGCCAGC GAACTCTTTC TGGTCTTCAC GCACCCGCGC TTCTACCTGC CCTACGTCCA TCCCGCCGTC TTCTATGGAA TCGGCGGAAC GCTGGTCCTG GCGACGGCGT GGCTGTTTTC GGTCGAACCG CCCGCCATGC CGGCCCATCC CGTACTGTCC CGCGTGGCGG GGGCCCTGCT GCTGGCCGTT CCGATCGGTG CCTATTTCTG GCGCCCGTCG CTGAACATCA TGGCAAAAGT CCTGCGCAAG TTCGGACCGA GCGGCGATCC GGTGCTGGAC GCGCAAAAAC TGGGCATGCT GGGCAGCTTC GCCGCCCATA CGGTCTGCTC GCGCCAGGAA CGGCCTGAAC GCCAGGCCGC CTGCAGCTTC GGCATCGTGA CCGTGCCGGC GGACAAGCCG CCGGTGGTCC TGGTCCAGGC CGAATCCTTC TTCGATATCG GCCGGCTGGA CCCCACCCAG CCGTCCCCCC TGACGGAGTA TATGGCCTGC CGGGACCGCG CCTGGCGCCA TGGCCACCTG AATGTCGATT CATGGGGGGC CAACACGACA CGCAGCGAAT TCGCGGTCAT CAGCGGCGCC TCCCCATCCG ACCTGGGACT GGATCGCTTC AACCCCTATT ATTCCTTCGC CCGCCGTCCC CTGGACACCC TGGCCGCGCG CATGCGGCAG GCGGGCTACC TGACGGTCTG CGTCCATCCC TACGACCGCC GGTTCTACGG CCGGCACAAG GTCATGCCCA ATCTCGGCTT CGACCAGTTC ATCGGCGGCG AGGCCTTCGA CGCCCCGCAC GGCCACCTCG TCCCCGACGA GGTCCTGGGC GCCTGGATCA ACGACTTCGT CGACCGCCAG ACCCAACCCG TCTTCGTCTT CGCCATCACA GTCGCCAACC ACGGCCCCTG GCCAACCCAG GCCACAACCC CCGGCCCCTT TGCCCCCAAA CTCGGCGGCT ACCTGGACAG CCTGATGGCC ACCGACCGAA TGATCGGCCA ACTGGCGTCC TCCCGCTGGC TGAACGATGA CGGCGGCATT TTTGCCCTAT ACGGCGACCA CCAGCCCAGC CTGCCCATGC TGAATAATGG CACCTTCGAT ATCGGCACCT CGACCGATTA TTTCATCCTG GACCGCACAC GGCAGGCCGG CCATCGCCGC GACATCGGTG TCCATCAGTT GGGCCGTAAC ATTGCGGAAT GCCTGATGGA ATGCCCCTAA
|
Protein sequence | MQAWAIAVVS VLGLVTLFVL RFTSGKLTRS FLNDIPDIVI GITLWLALTL LTQRPASSAL LWTVLAAGLL LADHTKRQVL REPVVFADAS ELFLVFTHPR FYLPYVHPAV FYGIGGTLVL ATAWLFSVEP PAMPAHPVLS RVAGALLLAV PIGAYFWRPS LNIMAKVLRK FGPSGDPVLD AQKLGMLGSF AAHTVCSRQE RPERQAACSF GIVTVPADKP PVVLVQAESF FDIGRLDPTQ PSPLTEYMAC RDRAWRHGHL NVDSWGANTT RSEFAVISGA SPSDLGLDRF NPYYSFARRP LDTLAARMRQ AGYLTVCVHP YDRRFYGRHK VMPNLGFDQF IGGEAFDAPH GHLVPDEVLG AWINDFVDRQ TQPVFVFAIT VANHGPWPTQ ATTPGPFAPK LGGYLDSLMA TDRMIGQLAS SRWLNDDGGI FALYGDHQPS LPMLNNGTFD IGTSTDYFIL DRTRQAGHRR DIGVHQLGRN IAECLMECP
|
| |