Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_3411 |
Symbol | |
ID | 5423607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 3802405 |
End bp | 3803985 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640882664 |
Product | sulfatase |
Protein accession | YP_001418297 |
Protein GI | 154247339 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.420036 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTATCC GAGGATTACT GGGAGCATTC ATGCTCACGG CGACCGCGAC GTTGACAGCC GTCACGCCGG CAGCCGCACA GCAGCAACCG ACCTCCAAGC CCAACATTCT CGTCATCTTC GGTGACGATA TCGGGCAAAC CAATCTGTCG ACCTACAGCT TCGGCCTGAT GGGCTATCGC ACGCCGAACA TCGACAGGAT CGCCAACGAG GGCCTGAAGT TCACCGACTA TTATGCCGAG CAGAGCTGCA CGGCGGGCCG CTCGACCTTC ATCACCGGCC AGTCGACCCT GCGTACGGGC CTGTCAAAGG TGGGCCTGCC CGGCGCCGAT CTCGGCCTTC AGGCCAGCGA CGTCACCATG GCCTCCGCGC TGAAGGACCT CGGCTACGCC ACCGGCCAGT TCGGCAAGAA CCACCTCGGC GACCGCGACG AATTCCTGCC GACCGCGCAC GGGTTCGACG AATTCATGGG CAACCTCTAC CACCTCAATG CGGAGGAGGA GCCGGAGAAT TTCAACTATC CGCAGGATCC CGCCTTCCGC AAGCAGTTCG GCCCGCGCGG CGTCATCAAG AGCTCGGCCG ACGGCAAGAT CGAGGACACC GGCCCGCTGA CGCGCAAGCG CATGGAGACG GTGGACGACG AGACCTCCAA GGCCGCCATC GACTTCATCG ACCGACAGGC GGCGGCCAAG AAGCCCTTCT TCGTGTGGAT GAACACCACG CGGATGCATT TCCGCACTCA TGTCCGCGCT GAAAACCGCA GCAAGCCCGG TCTCACCGCG CTGACCGAAT ATGCCGACGG CATGATCGAG ACCGACAAGG TGATCGGCAC GATCCTCGAC AAGATCGACC AGCTCAAGCT GGCCGACAAC ACCATCGTCA TCTACACCAC CGACAACGGC CCCCACCAGA ATTCCTGGCC GGATGCGGGC ACCACGCCAT TCCGCAGCGA GAAGAACACC AATTGGGAAG GCGCATTCCG CGTTCCGGCC CTGATCCGCT GGCCGGGACA TATCCAGCCG GGTTCGGTCG CGAACGGCAT CTTCTCCGGC CTCGACTGGT TCCCCACCCT GCTCGCCGCG GCGGGAGACA CGACCATCAA GGAACGTCTC CTCAAGGGCA CGACCATTGC CGGCAAGCAG TACAAGAACC ATCTCGACGG CTATAACCAG CTCGACTATC TCACCGGAAA GAGCGACAAG AGCGCCCGCA AGGAGTTCAT CTACTTCAAC GACGACGGCC AGATCGTGGC CATGCGCTAC GAGAACTGGA AGCTGGTCTT CTCTGAACAG CGCGCGACAG GCACGCTGCG CGTCTGGGCG GAGCCGTTCA CGCAGCTGCG TCTTCCCAAG ATGTTCGACC TGCGTTCCGA TCCCTATGAG CGGGCCGACC TCACATCCAA CACCTATTAC GACTGGATGC TCGACCGCGC CTACCTGGTC GTGCCGGCCC AAGCTGGGGT CGCGAAGTTC CTGGGCACCT TCAAGGAGTT TCCGCCAGCG CAGCGCCCGG CGAGCTTCTC GATCGATCAG ATCCAGAGCC AGCTCGAAGA GCAGTTCAAG AACGTAGCCG GGGGCCAGTA G
|
Protein sequence | MCIRGLLGAF MLTATATLTA VTPAAAQQQP TSKPNILVIF GDDIGQTNLS TYSFGLMGYR TPNIDRIANE GLKFTDYYAE QSCTAGRSTF ITGQSTLRTG LSKVGLPGAD LGLQASDVTM ASALKDLGYA TGQFGKNHLG DRDEFLPTAH GFDEFMGNLY HLNAEEEPEN FNYPQDPAFR KQFGPRGVIK SSADGKIEDT GPLTRKRMET VDDETSKAAI DFIDRQAAAK KPFFVWMNTT RMHFRTHVRA ENRSKPGLTA LTEYADGMIE TDKVIGTILD KIDQLKLADN TIVIYTTDNG PHQNSWPDAG TTPFRSEKNT NWEGAFRVPA LIRWPGHIQP GSVANGIFSG LDWFPTLLAA AGDTTIKERL LKGTTIAGKQ YKNHLDGYNQ LDYLTGKSDK SARKEFIYFN DDGQIVAMRY ENWKLVFSEQ RATGTLRVWA EPFTQLRLPK MFDLRSDPYE RADLTSNTYY DWMLDRAYLV VPAQAGVAKF LGTFKEFPPA QRPASFSIDQ IQSQLEEQFK NVAGGQ
|
| |