Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_25880 |
Symbol | |
ID | 7761499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2643063 |
End bp | 2645216 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643805469 |
Product | Sulfatase protein |
Protein accession | YP_002799742 |
Protein GI | 226944669 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTCGA GCCGCCCTGC CGGACTCGCC TCTTTGCAAA AGCGCCGGGC GTCGGCCCGG CAATCTCAGG AATCGCCCCC GGGGACGCGG GACGAGGCAA GTTTCCAGCA GTCTGCCGCT CGAACCTCCT GCAGTGTCCG CCAGCACCTG GCCTTCACCC TGGCCAGCGC CGCGCTGCTG ATGCTTCTGT ATTCCCTGCT GCGCCTGGCC CTGCTGATCT ACAACCGCGA GCAGATCGGC GCGGCGTCCG CCGCGACCCT GGCCGAGGCG TTCTTCAACG GTTCGCGCTT CGATCTACGG CTGGTCGTCT ATATCTGTGC GCCCCTGGTG CTGGCCGTGC TCAGTCGGCG GGCGATGCGA GCGAGGACGC CGTTGCGTGT CTGGCTGATC TCGTCCGCCA GCCTCACCCT GTTCCTCGGC ATGCTGGAGC TGGACTTCTA CCGCGAGTTC CACCAGCGCC TGAACGACCT GGTATTCCAG TACCTGAAGG AAGATCCGCG AACCGTCCTG AGCATGCTCT GGCACGGTTT CCCGGTTTTC CGTTATCTGC TCGCCTGGTT CCTGGCGACG CTGGCGCTGG GCTGGATGTT CAAGCGCCTG GATCTGCTCA CCCGCTCGCC GGCCGGGACT GGCCGGGGCG AGGGCGGTCG CTGGCCGATC CGGGTTCTGG CCTTCTCGCT CTGTCTGGCG TTCACGGTGC TCGTGGCGCG CGGAACCCTG CGCCAGGGGC CGCCGTTGAA GTGGGGCGAT GCTTTCACCA CCGACTCGAT GTTCGCCAAC CGGCTCGGCC TCAACGGCAC CCTGAGCCTG GCGGATGCGC TGCGCAACCG GCTCTCCGAT CGACGCGACA ATCTCTGGAA GGCCGAAATG GCGGATGGCG AGGCGCTGCG GACCGTACGG CAGATGTTGC TGACCGGCGC CGACCGGCTG GTGGATGGGG ACAGGGCGCC CGTGCGCCGC GATTATCGGC CGTCGCCCGG CGGTACCCTG CCGGTTCGCA ACGTGGTGGT GATCCTGATG GAAAGCTTCG CCGGCCACTA TGTCGGCGCC CTGGGCGCGC CGGGCGGCAT TACGCCGAAC TTCGACCGGC TGGCCGGGGA GGGGCTGCTG TTCACCCGCT TCTTCTCCAA CGGTACCCAT ACCCACCAGG GCATGTTCGC CACCATGGCC TGCTTCCCCA ATCTGCCCGG CTTCGAATAC CTGATGGAGA CGCCCGAGGG CGGCCATCGG TTCTCGGGCT TGCCGCAATT GCTCGGCGCC CGTGGCTACG ACAGCCTGTA CGTCTACAAC GGCGATTTCG CCTGGGACAA CCAGTCGGGT TTCTTCGGCA GTCAGGGGAT GAAGAACTTC ATCGGTCGGA ACGACTTCGT CGATCCGGTG TTTTCCGACC CGACCTGGGG CGTTTCCGAC CAGGACATGT TCGACCGCGC CGCCCAGGAG CTGGAGCGCC GGAGCGAGGA CGGGAAACCG TTCTACGCCC TGCTGCAGAC GCTCTCCAAC CACACGCCCT ATGCCTTGCC GGAGCACCTG CCGATGGCGC CGGTGAGCGG TTTCGGCGAA CTGGACCAGC GCCTGACCGC CATGCGCTAT TCGGACTGGG CGCTCGGGCG CTTCTTCGAC AGGGTCCGCC ACGCGCCTTA TTTCGAGGAC ACCCTGTTCG TGGTGGTCGG CGACCACGGC TTCGGCAGCC GCGAGCAACT CACCGAGCTG GACCTGCTGC GCTTCAACGT GCCCCTGCTG CTGATCGGTC CGGGCGTGCG GGAGAAGTTC GGCGCCCGCC GCGACATCGT CGGCACCCAG GTCGACGTGG TACCGACCAT CATGGGCCGG CTGGGCGGCG AGGTGCGTCA CCAGTGCTGG GGACGCGACC TGCTCGCTCA GCCGGCGGGA AGCCCGGGGT TCGGCGTCAT CAAGCCCTCG GGCGGCGACC GGAGCGTCGC CCTGGTCAGT GGCGACCGGG TGCTGGTGCA GCCACCGGGG CGGGCGGCGA AGGTCTATCG CTACCGCCTC GGCAGCGATC CCGCCAGTCT GCCGCTCGCC GAGGTGCCCG ACGAGGCACT GCTGAAGCGG CAACTGGGCG CCTTCCTGCA GGCGGCCACC GCCAGTCTGC TGGACGATAC GGCCGGAGCG GACGACGGCG GGACGGAGCG CTGA
|
Protein sequence | MFSSRPAGLA SLQKRRASAR QSQESPPGTR DEASFQQSAA RTSCSVRQHL AFTLASAALL MLLYSLLRLA LLIYNREQIG AASAATLAEA FFNGSRFDLR LVVYICAPLV LAVLSRRAMR ARTPLRVWLI SSASLTLFLG MLELDFYREF HQRLNDLVFQ YLKEDPRTVL SMLWHGFPVF RYLLAWFLAT LALGWMFKRL DLLTRSPAGT GRGEGGRWPI RVLAFSLCLA FTVLVARGTL RQGPPLKWGD AFTTDSMFAN RLGLNGTLSL ADALRNRLSD RRDNLWKAEM ADGEALRTVR QMLLTGADRL VDGDRAPVRR DYRPSPGGTL PVRNVVVILM ESFAGHYVGA LGAPGGITPN FDRLAGEGLL FTRFFSNGTH THQGMFATMA CFPNLPGFEY LMETPEGGHR FSGLPQLLGA RGYDSLYVYN GDFAWDNQSG FFGSQGMKNF IGRNDFVDPV FSDPTWGVSD QDMFDRAAQE LERRSEDGKP FYALLQTLSN HTPYALPEHL PMAPVSGFGE LDQRLTAMRY SDWALGRFFD RVRHAPYFED TLFVVVGDHG FGSREQLTEL DLLRFNVPLL LIGPGVREKF GARRDIVGTQ VDVVPTIMGR LGGEVRHQCW GRDLLAQPAG SPGFGVIKPS GGDRSVALVS GDRVLVQPPG RAAKVYRYRL GSDPASLPLA EVPDEALLKR QLGAFLQAAT ASLLDDTAGA DDGGTER
|
| |