Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_10410 |
Symbol | |
ID | 7759986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 986156 |
End bp | 988345 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643803946 |
Product | Sulfatase |
Protein accession | YP_002798248 |
Protein GI | 226943175 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAGCACC CTCTGTCCTC CCTCGCCGCC CTGATCGGCT TGCTGCTCGC CGTTCCCCTG GGCCTGCGCC TGGCCCTGGG CTGGTCCGAC CCGCTCGGCT ATCTGTCGGA CCTGGGCATT GCCGGCCTGC TGGTCGTGCT TCTACAGCGC CGTCCATGGT GGCTGGCGTT GCCGGTGCTG TCGGTGTGGT GCCTGATGAC CCTGGCCTCG GTCGAGCTGG TCAGTGCGGT CGGCAGGCTG CCGAGCATGG CGGACGTCCA TTACCTGCTC GACTTGCAGT TCCTGGAAAA CTCCACTGGC GGCGGTTTCG CCCAGCCCTG GCTGGCCGCC GCCCTGGCGT CCGGCCTGGC GTTCTGGCTG ATCGCCCGCT GGGCCGGCCG TTCGAAACCC GCCGCAGGCT TGCCCCGGCA CGCCTGGACG GTTCCGGCGC TGCTCCTGCT GGTCCACGGC GGCGTGCAAT ACCGGTATCC CAGCGAGGAG GACCCGTGGC GGCTGTTCAA CCTGCCGCAC CAACTGCTGG TGGCGGGCAT CGGGGAGGGA CAGGCCCGTC TGGAGGAATG GCTGGAAGGC GACGGCGCCG ACACTCCACC GCGGATGGCG GGCCTCACCC GGCTCGACCT GGATGGACGG AAACTGCTGC CGGAGCCCGG ACGCGCACGC AATGTGCTGC TGGTCGTGCT CGAAGGCATT CCGGGCGCCT ATGTGGGGGT GAACCGGCAT GCCCTGCACA GTAGCTACCG GGAAAACCCG ATGCCGCACC TGAGCGCATG GGCCGAGCGC GGCATGAACA CGCCCGACTA CGTGCTGCAC ACCCATCAGA CCATTCGCGG CCTGTATTCG ATGCTCTGCG GCGACTACGA CAAGCTCGAC AACGGCACGC CCAAGGGGGT CGAGATGCTG GCTCTGACCC GGCGTAACCA GGACTGCCTG CCGGCCCGCC TGCGCAAGAA CGGGTTCGCC ACGCACTACC TGCAGGGCGC AGGCCTCAGG TTCATGGCCA AGGACCGGAT CATGCCGCAC ATCGGCTTCG ACACCACCCT GGGCCGGGAG TGGTTCGCCA GGCCGCCGTA CCTGGATTTC CCCTGGGGCC AGGACGACAA GGCGTTCTTC GAAGGCGCAC TGGACTATGT CGGACAACTG CGGCGGCAGG AGCGGCCCTG GATGCTCACG CTGCTGACCG TCGGTACTCA CCAGCCCTAT TCGGCCCCCG AGGAATACCT GCAGCGCCAT GACTCGCCCA AGCGGGCGGC GGTGGGCTAT CTGGACGACG CGCTCGGGCA GTTCCTGACC GAACTGGAGC GGCGGGGCGT GCTGCGGGAT ACCCTGGTCA TCGTCACTTC GGACGAATCC CATGGTATCG ACGGCGTGCG CCTGGCCTCG TCCTGGGGCT TCGCCCTGGT GCTGGCGCCG GAGCGGGAGC GTTTGCCCAG GGTGAAGTCC GGGGTCTACG GGCACGTCGA TCTGAGCGCC TCGGTACTCG ACTACTTCGC TTTCCCGGTG CCCGCCAGCC TGAGCGGCCG TTCGCTGTTC CGCGACTACG AGACGGGCCG GGAGATTATG TCGTTCACCA ACGGCAAGCT GCGCTATCAC GATGGCCGGG GAACCCTGAC CGAATGCGAT TTCCAGCAGC GCTGCCGGTA TTACGCGAGC GAAGGTTTCA TCGCCGAGCG AGCGACCTTT CTCGGCCAGT ATGGCGGCAA GCGTGCGCGA CAGATAATGT CGAGGGCCGC CGCGCTGGAC AGCGCCCTGC TGCGTACCGC GTCGAACCGA CGCTATCAGT TCGGAAGTCC GGCGAGAATC CCGCTGCGGG CGCAAGTCGA GGACGACTGG GCCGACAACC TCATCGGCGC CCAGTATCTG GAGATGCCCA AGGGATCGCA CACCCGCGTG CGCCTGACCG TCCGCGCCGT GGAGCCGCGA CAGAACGCCT ACATTCTGCT CAAGGCCAAG GAGTTCGAGC AGGACGTGCC CCTGGGCCTG CCGACGGAAA TGCTGGTCAC GCCCGAACAG CCGCTGGAGA TGGATTTCGG TTTCGACAAT CCGGAGTCGC GCAAGGCGTT CTCCTTTCAC CTGCTCGGCT ATGGGCCGGG CGCCATCGAG ATAAGCGACT TCAGCGTGAT CACCGAGTTG CCGGGACAGG CGGAATCGCT GGACGAAGTC GCGGACGACG ACGATGCCCG GTCGAGCTGA
|
Protein sequence | MQHPLSSLAA LIGLLLAVPL GLRLALGWSD PLGYLSDLGI AGLLVVLLQR RPWWLALPVL SVWCLMTLAS VELVSAVGRL PSMADVHYLL DLQFLENSTG GGFAQPWLAA ALASGLAFWL IARWAGRSKP AAGLPRHAWT VPALLLLVHG GVQYRYPSEE DPWRLFNLPH QLLVAGIGEG QARLEEWLEG DGADTPPRMA GLTRLDLDGR KLLPEPGRAR NVLLVVLEGI PGAYVGVNRH ALHSSYRENP MPHLSAWAER GMNTPDYVLH THQTIRGLYS MLCGDYDKLD NGTPKGVEML ALTRRNQDCL PARLRKNGFA THYLQGAGLR FMAKDRIMPH IGFDTTLGRE WFARPPYLDF PWGQDDKAFF EGALDYVGQL RRQERPWMLT LLTVGTHQPY SAPEEYLQRH DSPKRAAVGY LDDALGQFLT ELERRGVLRD TLVIVTSDES HGIDGVRLAS SWGFALVLAP ERERLPRVKS GVYGHVDLSA SVLDYFAFPV PASLSGRSLF RDYETGREIM SFTNGKLRYH DGRGTLTECD FQQRCRYYAS EGFIAERATF LGQYGGKRAR QIMSRAAALD SALLRTASNR RYQFGSPARI PLRAQVEDDW ADNLIGAQYL EMPKGSHTRV RLTVRAVEPR QNAYILLKAK EFEQDVPLGL PTEMLVTPEQ PLEMDFGFDN PESRKAFSFH LLGYGPGAIE ISDFSVITEL PGQAESLDEV ADDDDARSS
|
| |