Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_2420 |
Symbol | |
ID | 5423780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 2701931 |
End bp | 2704453 |
Gene Length | 2523 bp |
Protein Length | 840 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640881673 |
Product | sulfatase |
Protein accession | YP_001417319 |
Protein GI | 154246361 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.73372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.229753 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGCA TCTTTGCGCT GGCCCTGACG GCCAGCCTCA CCGTGCTCGG CGTGATCCCC GCCGCCGCGC CCGCCGGCGC ACAGAAGATC AACGGCGTGC CCGGCGCCCC GGATGCCAGC ATCGTCATCG ACGGCAGCCA GATCCCGGCG CCGCCGCAGA AGTTCGAGGG CGTGATCAAG GAGCTGGCGC AGGACTCCAA GCCGTACTGG CCGGCGCGGA TCGTGCCGCC TACCGGCGCG CCCAACGTCC TGCTCATCAT GACCGACGAC GTGGGCTTCG GCGCGCCGTC CACCTTCGGC GGCGTCATTC CCACGCCCAG CCTCGACCGC ATCGCCAATG ACGGGCTGCG CTACACCAAT TTCCACTCCA CCGCCCTGTG CTCGCCCACC CGCGCGGCGC TGATCACCGG ACGCAACCAC CATTCCGCCG GCTTCGCAGT GGTGTCCGAG ATGGCCACGG GCTATCCGGG CTACGACAGC ATCATCACCA AGGACAAGGC CACCATCGGC CGGATCCTGA AGGACAACGG CTACCGCACC TCGTGGTTCG GGAAGAACCA CAACACGCCG TCGTTCCAGG CCATCTCCAC CGGTCCGTTC GACCAGTGGC CGACGGGCAT GGGGTTTGAG TATTTCTACG GCTTCATGGG TGGGGACACC AGCCAGTGGC AGCCGGACAA CCTGGCCCGC AACACCACCT ATATCTATCC GTTCCAGGGC AATCCTTCGT TCAACCTGAC CACGGCCATG GCGGACGAAG CCATCGCCTA CATGAACAAG ATCAACACCC TGACGCCGGA CCAGCCGTTC TTCGTCTATT ACGTGCCCGG CGGCACCCAC GCCCCGCACC ACCCCACCCC CGAATGGATC GAGAAGGCGA CGCAGCTCCA CCTGTTCGAC AAGGGCTGGA ACGCGCTGCG CGAGCAGATC TTCGAGAACC AGAAGAAGCT CGGCGTCATC CCCCAGAACG CGAAGATGAC GCCCTGGCCG GACGACCTGC TGAAGCGCTG GGACAAGCTC ACCGACGACG AACAGAAGAT GTTCAAGCGC CAGGTTGATG TCTACGCCGC CTACCTGATG TATACGGACC ATGAGATCGG CCGCGTCATC CAGGCTGTCG AGGACATGGG CAAGCTGGAC AACACGCTGG TCATCTATAT CAGCGGCGAC AACGGCTCCA GCGCCGAAGG CACGCTCATC GGCACCCCCA ACGAAGTCGC CATGTTCAAC GGCGTCGATG TGCCGGTCGC AGACCAGCTG AAGTATTTCT ATGATGTCTG GGGCTCGGAC AAGACCTACA ACCACATGGC GGTGGGCTGG ACCTGGGCCT TTTCCACCCC CTTCTCCTGG ACCAAGCAGG TCGCCTCCCA TTTCGGCGGC ACCCGGCAGG GCATGGCCAT CTCCTGGCCC AAGGTGATCA AGGACAAGGG CGGCATCCGC TCCCAGTTCC ACCATGTGAT CGACATCGCG CCGACCATCC TGGAGGCCAC CGGCATCAAG GCTCCCGATA TGGTGGACGG CATCAAGCAG GCTCCCATCG AGGGGGTCAG CATGACCTAT ACCTTCGACA AGGCCAATGC GACCGCACCC TCCACCCACA AGACCCAGTA TTTCGAGATG ATGGGCGACC ATGCCATCTA TAACGACGGC TGGATGCTGA GCAGCAAGGT GGTGCGTCCG CCGTGGGAGG TCCAGGCCGG CCTCGGCCTC GACCCGTCCA AGTACCCATG GGAACTCTAC AATATTTCGG AAGACTGGAC CCAGTACGAG GACGTTGCCG CCAAGCATCC CGACAAGGTG AAGGCGATGG CGGACCTGTT CTGGTCCGAA GCCAGGAAGT ACCAGGTGCT CCCGCTGGAC GCCACCGTCG CCACCCGGCT GGTGACGCCG CGCCCGAGCA TCACCGCGGG TCGCGACGTG TTCACCTGGA CCGCTCCGCT CACCGGCACG CCCAATGGCG ACGCGCCGTC GGTGCTGAAC ACCTCCTACC GCTTCACCGC GGACGTGGTG GTGCCCGAGG GCGGCGGCGA CGGCATGCTC ATCACCCAGG GCGGGCGGTT TGCCGGCTAT GGCTTCTACC TGCTCAAGGG CAAGCCCACC TTCACCTGGA ACCTCGTGGG CCTGAAGAAG GTGAAATGGC AGGGCACCGA GCCGCTGGCA CCGGGCAAGC ACACCCTGGT TTTCGACTTC AAGTATGACG GCCTCGGCGC GGCGACGCTG GCCTTCGGCT CGGGCAGCGG GCTCGGCCAG AGCGGCACCG GGACGCTGAG CGTGGACGGC AAGGTGGTGG CGACCCAGTC CATGCCCCAC ACCATCCCGC TCATCCTCGC CTGGGACGAG AATCTGGACG TGGGCTCCGA CACCGGCACG CCGGTGGACG ATGCCGATTA CCAGGTGCCG TTCGCCTTCA CCGGCAAGAT CAACAAGATC ACGCTCGCGC TTGATCATCC GAAGCTGACG CCGGAGGATA TCGCCAAGCT GCGCGACGCC GCTGCCAAGG CTGCCGACGG CCCGTCGAAA TAG
|
Protein sequence | MSRIFALALT ASLTVLGVIP AAAPAGAQKI NGVPGAPDAS IVIDGSQIPA PPQKFEGVIK ELAQDSKPYW PARIVPPTGA PNVLLIMTDD VGFGAPSTFG GVIPTPSLDR IANDGLRYTN FHSTALCSPT RAALITGRNH HSAGFAVVSE MATGYPGYDS IITKDKATIG RILKDNGYRT SWFGKNHNTP SFQAISTGPF DQWPTGMGFE YFYGFMGGDT SQWQPDNLAR NTTYIYPFQG NPSFNLTTAM ADEAIAYMNK INTLTPDQPF FVYYVPGGTH APHHPTPEWI EKATQLHLFD KGWNALREQI FENQKKLGVI PQNAKMTPWP DDLLKRWDKL TDDEQKMFKR QVDVYAAYLM YTDHEIGRVI QAVEDMGKLD NTLVIYISGD NGSSAEGTLI GTPNEVAMFN GVDVPVADQL KYFYDVWGSD KTYNHMAVGW TWAFSTPFSW TKQVASHFGG TRQGMAISWP KVIKDKGGIR SQFHHVIDIA PTILEATGIK APDMVDGIKQ APIEGVSMTY TFDKANATAP STHKTQYFEM MGDHAIYNDG WMLSSKVVRP PWEVQAGLGL DPSKYPWELY NISEDWTQYE DVAAKHPDKV KAMADLFWSE ARKYQVLPLD ATVATRLVTP RPSITAGRDV FTWTAPLTGT PNGDAPSVLN TSYRFTADVV VPEGGGDGML ITQGGRFAGY GFYLLKGKPT FTWNLVGLKK VKWQGTEPLA PGKHTLVFDF KYDGLGAATL AFGSGSGLGQ SGTGTLSVDG KVVATQSMPH TIPLILAWDE NLDVGSDTGT PVDDADYQVP FAFTGKINKI TLALDHPKLT PEDIAKLRDA AAKAADGPSK
|
| |