Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_2939 |
Symbol | |
ID | 5421821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | + |
Start bp | 3263331 |
End bp | 3265001 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640882190 |
Product | sulfatase |
Protein accession | YP_001417832 |
Protein GI | 154246874 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACAG CAATCTCGAC CACGCTGGCC GCCGCGGTGG CGGCAACGCT CATCGTTTGG GGCCCCGCAG CCCAGGCCCA AGGCACCGCG CCTGCCGCAG CCCAGGCCAA GAAGCCGAAC ATCCTGCTGA TCGTCTCCGA CGATACCGGC TGGGGCGACC TCGGCGCCTT TCTCGGCGGC GCGCGCCGCG GCATGCCGAC GCCCAACATG GACGAACTGG CGGCCGAGGG CATGGTGTTC ACCAACTTCT ACGGCCAGCC GAGCTGCACG CCCGGCCGGG CCGCGATCCA GACCGGGCGC ATCCCCAACC GCTCCGGCAT GACCACCGTG GCCTTCCAGG GCCAGGGCGG CGGCCTGCCG AAGGCCGAAT GGACACTGGC CTCCGTGCTC AAGCAGGGCG GCTATAAGAC CTATTTCACC GGGAAATGGC ATCTCGGCGA GGCGGACTAT GCGCTGCCCA ACGCCCAGGG CTATGACGTG ATGAAGTACG CCTTCCTCTA CCACCTGAAC GCCTACACCT ATCCGGATCC CAAATGGTTC CCCGGCATGA GCCCCGATTT GCGGCACATG TTCGATACCG TCACGAAGGG CGCGCTATCC GGCAACGCCG GCGGCCCGGT GAAGGAAGAC TTCAAGGTCA ACGGCGAGTA TGTGGACACG CCCGACAAGG GCGTGGTCGG CATTCCCTTC CTTGACGCCT ACGTCGAGAA GGCCGCGGTC GAGTTTCTGG AGGAGGCCGC CAAGGCGCCC AACACGCCGT TCTTCATCAA TGTCAACTTC ATGAAGAACC ACCAGCCGAA CCTGCCGGCG CCCGAATTCG TGGGCAAGTC CCTTTCCAAG TCCAAGTATG CGGATTCGGT GGTGGAGCTG GACGCGCGGA TCGGCAACGT GCTGAAGAAG CTCGACCAGC TCGGCCTCGC CGACAACACG CTGGTGGTCT ACACCGTGGA TAACGGCGCC TGGCAGGACG TCTATCCCGA TGCCGGCTAC ACCCCGTTCC GCGGCACCAA GGGCACCGTG CGCGAAGGCG GCAACCTCGT CCCGTCCATG GCCCGCTGGC CGGGCAAGAT CAAGGCCGCG ACCAAGAGCG ACGACATCAT GGGCGGCCTG GACCTGATGG CCACCTTCGC CTCCGTCGCC GGCGTGAAGC TGCCGGAGAA GGACCGCGAG GGACAGCCCA TCATCTTCGA CAGCTACGAC ATGACTCCGG TCTGGCTCGG CAAGGGTGCG GATCTGCGGC ACGAGTGGTT CTACTTCACC GAGAACGAGC TCACCCCCGG GGCCGTGCGG GTCGACAACT TCAAATATGT CTTCAACCTG CGCGGCGATA ACGGCGCCTA TACGGGCGGC CTCGCAGTGG ACACGAACCA GGGCTGGAAG GGGCCGGAGA AGTACGTGGC GACCGTGCCG CAGGTGTTCG ACCTGCCGGC CGATCCGCAG GAGCGCTACG ACATCTTCAT GACCACCTTC ACCGAGAGCA CCTGGGCGGC GATCCCGTTC AACGTGGCGG TCGAGAAGCT CATGAAGACC TATGTGCAAT ATCCGCCCCG CAAGGCCCAG AGCGAGGCCT ATTCTGGCCC GATCACCCTG TCCCAGTATG AGCGCTTCAA GTTCGTGCAG GATGCGCTGA AGGAACAAGG TTTCAAGCTG CCCCTGCCCA CCGGCAACTG A
|
Protein sequence | MRTAISTTLA AAVAATLIVW GPAAQAQGTA PAAAQAKKPN ILLIVSDDTG WGDLGAFLGG ARRGMPTPNM DELAAEGMVF TNFYGQPSCT PGRAAIQTGR IPNRSGMTTV AFQGQGGGLP KAEWTLASVL KQGGYKTYFT GKWHLGEADY ALPNAQGYDV MKYAFLYHLN AYTYPDPKWF PGMSPDLRHM FDTVTKGALS GNAGGPVKED FKVNGEYVDT PDKGVVGIPF LDAYVEKAAV EFLEEAAKAP NTPFFINVNF MKNHQPNLPA PEFVGKSLSK SKYADSVVEL DARIGNVLKK LDQLGLADNT LVVYTVDNGA WQDVYPDAGY TPFRGTKGTV REGGNLVPSM ARWPGKIKAA TKSDDIMGGL DLMATFASVA GVKLPEKDRE GQPIIFDSYD MTPVWLGKGA DLRHEWFYFT ENELTPGAVR VDNFKYVFNL RGDNGAYTGG LAVDTNQGWK GPEKYVATVP QVFDLPADPQ ERYDIFMTTF TESTWAAIPF NVAVEKLMKT YVQYPPRKAQ SEAYSGPITL SQYERFKFVQ DALKEQGFKL PLPTGN
|
| |