Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_08230 |
Symbol | |
ID | 7759777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 780269 |
End bp | 782041 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643803741 |
Product | Arylsulfotransferase |
Protein accession | YP_002798043 |
Protein GI | 226942970 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCATT TTGAACAACA TAGCTCCCCA TCGCCTTCCG CCCATCCGGA GCGCGGGGCA CAGCCCGAGC AGGCATTGCC CGAAGGCGCC TGTCTCACCG CCCGCATCCC CGACAGCGAC AACGCCAGGC TGGGCGCCAT CGTCGTCAAC CCGTATCGCC TGGCTCCGTT GACCGCCGTC ATCCGCGACG GGGGCCAGCG CATCTCCCAG GCGCGGGTGC GGGTCAAGGG ACGCGGCGAG GGAGGGGTGG ACATCGATTA CGCGGTCGAG GATCGCACTC TCTGGACTCA CGGCGGCATC CCGGTGTTCG GTCTCTATCC GGACCATCGC AACGAAGTCG AGGTGGCCTA CAGCCTGGAC GGCGAACGCA TTCGCGAGCG TTACTTCATC TATGCTCCGG CGGTGCGTCT GCCGGTGGTG GCCAGTCAGG AAAGCGCGCT GCCACGGGTC GAGCCGCTGA AGGTGGCGCC GCACCTGAAA AAGCGCCTGT ACCTGTTCAA CCACCTGCTC ACCGAGATTC CCGGTAGCAA CCGCCTGCTG AAGTGGAACG CTCCGGGCGG CGCCGCCGAG TGGGACTCGG TGGGGATCAA TTGGATCGCC GACAGCAACG GCGACGTGCG CTGGTACCTG GACATCGAGC AGATCCACGA CTCCACCCGC AAGGATGGGC TGGGGGCGAG CATGGGCTTC CAGCAGACCC GCGACGGCCA CCTGATCTGG GGCCAGGGCC AGCGCTACTA CAAGCACGAC CTGCTGGGCC GCACGATCTG GGCGCGCACC TTGCCGGAGA AGTTCGCCGA CTTCTCCCAT GAGATCCGCG AGACCGAAAA AGGTACCTAC CTGCTGCGGG TCGGTACCAG CGACTATCGT CGCGCCGACG GAAAGCGGGT GCGCTCGATC CGCGACCACA TCCTGGAAGT CGACCAGAAC GGCGATGTGG TGGACTTCTG GGACCTGAAC CGAATCCTCG ACCCCTATCG CGCCGAACTG CTGCACACCC TGGGACGGGC CGCGGTGTTG CTGCCCGAGG GCGTCGCCAG GAGTGACGAT CTGCTGGACA ACGAACGTAA CGAAGGCGAT GTCCTGCCGT TCGGTGACAC ACCCGGTGTC GGCACCGGGC GCAACTGGGC CCACGTCAAC GCCATCGAGC ATGACCCGGC CGACGACAGC ATCATCGTCT CCGCCCGTCA CCAGGGGGTG GCCAAGATCG GGCGCGACAA ACAGGTGAAA TGGCTGCTGG CCGATCCGCG AGGATGGAGC AGCGCCCTGC GGGCCAAGGT GCTCACGCCG GTAAACGCCG CTGGCGAGGT ATTGGCGCAG AATCCCAACG GCAGTTATCC CGAGGGCTTC GACTGGTCCT GGACCCAGCA CACCGCCTGG CTCAGCAGCA AGGGAACGCT CACCGTGTTC GATAACGGCT GGGGTCGCAA CCTGGCGCCC ACTCGCCTGG AAGGCAACTA CAGCCGGGCG GTGGAATACC GCATAGACGA GGAGAAGGGC ACCGTGCAGC AACTCTGGGA GTTCGGCAAG GAACGCGGCG ACGCCTGGTA CAGCCCGATC ACCTCGGTGG TGGAGTACCG CCCGGAATCC GACACCCTGC TGATCTACTC GGCGGCGATC GGGCATCTGA CGCCACAGCG GCTGACCCGG CCGGTGCTCA GCGAAGTGAA GTACGGCACC CAGGAAGTGC TCAGCGAATT CCGCGTGGTC AGCGGCCAGC CGGGCAACGT CGGTTACCGG GCGCTGGTGA TCGATCTGGA GCGGCTGTTC TGA
|
Protein sequence | MSHFEQHSSP SPSAHPERGA QPEQALPEGA CLTARIPDSD NARLGAIVVN PYRLAPLTAV IRDGGQRISQ ARVRVKGRGE GGVDIDYAVE DRTLWTHGGI PVFGLYPDHR NEVEVAYSLD GERIRERYFI YAPAVRLPVV ASQESALPRV EPLKVAPHLK KRLYLFNHLL TEIPGSNRLL KWNAPGGAAE WDSVGINWIA DSNGDVRWYL DIEQIHDSTR KDGLGASMGF QQTRDGHLIW GQGQRYYKHD LLGRTIWART LPEKFADFSH EIRETEKGTY LLRVGTSDYR RADGKRVRSI RDHILEVDQN GDVVDFWDLN RILDPYRAEL LHTLGRAAVL LPEGVARSDD LLDNERNEGD VLPFGDTPGV GTGRNWAHVN AIEHDPADDS IIVSARHQGV AKIGRDKQVK WLLADPRGWS SALRAKVLTP VNAAGEVLAQ NPNGSYPEGF DWSWTQHTAW LSSKGTLTVF DNGWGRNLAP TRLEGNYSRA VEYRIDEEKG TVQQLWEFGK ERGDAWYSPI TSVVEYRPES DTLLIYSAAI GHLTPQRLTR PVLSEVKYGT QEVLSEFRVV SGQPGNVGYR ALVIDLERLF
|
| |