Gene Xaut_3411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_3411 
Symbol 
ID5423607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp3802405 
End bp3803985 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content62% 
IMG OID640882664 
Productsulfatase 
Protein accessionYP_001418297 
Protein GI154247339 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.420036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTATCC GAGGATTACT GGGAGCATTC ATGCTCACGG CGACCGCGAC GTTGACAGCC 
GTCACGCCGG CAGCCGCACA GCAGCAACCG ACCTCCAAGC CCAACATTCT CGTCATCTTC
GGTGACGATA TCGGGCAAAC CAATCTGTCG ACCTACAGCT TCGGCCTGAT GGGCTATCGC
ACGCCGAACA TCGACAGGAT CGCCAACGAG GGCCTGAAGT TCACCGACTA TTATGCCGAG
CAGAGCTGCA CGGCGGGCCG CTCGACCTTC ATCACCGGCC AGTCGACCCT GCGTACGGGC
CTGTCAAAGG TGGGCCTGCC CGGCGCCGAT CTCGGCCTTC AGGCCAGCGA CGTCACCATG
GCCTCCGCGC TGAAGGACCT CGGCTACGCC ACCGGCCAGT TCGGCAAGAA CCACCTCGGC
GACCGCGACG AATTCCTGCC GACCGCGCAC GGGTTCGACG AATTCATGGG CAACCTCTAC
CACCTCAATG CGGAGGAGGA GCCGGAGAAT TTCAACTATC CGCAGGATCC CGCCTTCCGC
AAGCAGTTCG GCCCGCGCGG CGTCATCAAG AGCTCGGCCG ACGGCAAGAT CGAGGACACC
GGCCCGCTGA CGCGCAAGCG CATGGAGACG GTGGACGACG AGACCTCCAA GGCCGCCATC
GACTTCATCG ACCGACAGGC GGCGGCCAAG AAGCCCTTCT TCGTGTGGAT GAACACCACG
CGGATGCATT TCCGCACTCA TGTCCGCGCT GAAAACCGCA GCAAGCCCGG TCTCACCGCG
CTGACCGAAT ATGCCGACGG CATGATCGAG ACCGACAAGG TGATCGGCAC GATCCTCGAC
AAGATCGACC AGCTCAAGCT GGCCGACAAC ACCATCGTCA TCTACACCAC CGACAACGGC
CCCCACCAGA ATTCCTGGCC GGATGCGGGC ACCACGCCAT TCCGCAGCGA GAAGAACACC
AATTGGGAAG GCGCATTCCG CGTTCCGGCC CTGATCCGCT GGCCGGGACA TATCCAGCCG
GGTTCGGTCG CGAACGGCAT CTTCTCCGGC CTCGACTGGT TCCCCACCCT GCTCGCCGCG
GCGGGAGACA CGACCATCAA GGAACGTCTC CTCAAGGGCA CGACCATTGC CGGCAAGCAG
TACAAGAACC ATCTCGACGG CTATAACCAG CTCGACTATC TCACCGGAAA GAGCGACAAG
AGCGCCCGCA AGGAGTTCAT CTACTTCAAC GACGACGGCC AGATCGTGGC CATGCGCTAC
GAGAACTGGA AGCTGGTCTT CTCTGAACAG CGCGCGACAG GCACGCTGCG CGTCTGGGCG
GAGCCGTTCA CGCAGCTGCG TCTTCCCAAG ATGTTCGACC TGCGTTCCGA TCCCTATGAG
CGGGCCGACC TCACATCCAA CACCTATTAC GACTGGATGC TCGACCGCGC CTACCTGGTC
GTGCCGGCCC AAGCTGGGGT CGCGAAGTTC CTGGGCACCT TCAAGGAGTT TCCGCCAGCG
CAGCGCCCGG CGAGCTTCTC GATCGATCAG ATCCAGAGCC AGCTCGAAGA GCAGTTCAAG
AACGTAGCCG GGGGCCAGTA G
 
Protein sequence
MCIRGLLGAF MLTATATLTA VTPAAAQQQP TSKPNILVIF GDDIGQTNLS TYSFGLMGYR 
TPNIDRIANE GLKFTDYYAE QSCTAGRSTF ITGQSTLRTG LSKVGLPGAD LGLQASDVTM
ASALKDLGYA TGQFGKNHLG DRDEFLPTAH GFDEFMGNLY HLNAEEEPEN FNYPQDPAFR
KQFGPRGVIK SSADGKIEDT GPLTRKRMET VDDETSKAAI DFIDRQAAAK KPFFVWMNTT
RMHFRTHVRA ENRSKPGLTA LTEYADGMIE TDKVIGTILD KIDQLKLADN TIVIYTTDNG
PHQNSWPDAG TTPFRSEKNT NWEGAFRVPA LIRWPGHIQP GSVANGIFSG LDWFPTLLAA
AGDTTIKERL LKGTTIAGKQ YKNHLDGYNQ LDYLTGKSDK SARKEFIYFN DDGQIVAMRY
ENWKLVFSEQ RATGTLRVWA EPFTQLRLPK MFDLRSDPYE RADLTSNTYY DWMLDRAYLV
VPAQAGVAKF LGTFKEFPPA QRPASFSIDQ IQSQLEEQFK NVAGGQ