Gene Xaut_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_2420 
Symbol 
ID5423780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp2701931 
End bp2704453 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content65% 
IMG OID640881673 
Productsulfatase 
Protein accessionYP_001417319 
Protein GI154246361 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.73372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.229753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA TCTTTGCGCT GGCCCTGACG GCCAGCCTCA CCGTGCTCGG CGTGATCCCC 
GCCGCCGCGC CCGCCGGCGC ACAGAAGATC AACGGCGTGC CCGGCGCCCC GGATGCCAGC
ATCGTCATCG ACGGCAGCCA GATCCCGGCG CCGCCGCAGA AGTTCGAGGG CGTGATCAAG
GAGCTGGCGC AGGACTCCAA GCCGTACTGG CCGGCGCGGA TCGTGCCGCC TACCGGCGCG
CCCAACGTCC TGCTCATCAT GACCGACGAC GTGGGCTTCG GCGCGCCGTC CACCTTCGGC
GGCGTCATTC CCACGCCCAG CCTCGACCGC ATCGCCAATG ACGGGCTGCG CTACACCAAT
TTCCACTCCA CCGCCCTGTG CTCGCCCACC CGCGCGGCGC TGATCACCGG ACGCAACCAC
CATTCCGCCG GCTTCGCAGT GGTGTCCGAG ATGGCCACGG GCTATCCGGG CTACGACAGC
ATCATCACCA AGGACAAGGC CACCATCGGC CGGATCCTGA AGGACAACGG CTACCGCACC
TCGTGGTTCG GGAAGAACCA CAACACGCCG TCGTTCCAGG CCATCTCCAC CGGTCCGTTC
GACCAGTGGC CGACGGGCAT GGGGTTTGAG TATTTCTACG GCTTCATGGG TGGGGACACC
AGCCAGTGGC AGCCGGACAA CCTGGCCCGC AACACCACCT ATATCTATCC GTTCCAGGGC
AATCCTTCGT TCAACCTGAC CACGGCCATG GCGGACGAAG CCATCGCCTA CATGAACAAG
ATCAACACCC TGACGCCGGA CCAGCCGTTC TTCGTCTATT ACGTGCCCGG CGGCACCCAC
GCCCCGCACC ACCCCACCCC CGAATGGATC GAGAAGGCGA CGCAGCTCCA CCTGTTCGAC
AAGGGCTGGA ACGCGCTGCG CGAGCAGATC TTCGAGAACC AGAAGAAGCT CGGCGTCATC
CCCCAGAACG CGAAGATGAC GCCCTGGCCG GACGACCTGC TGAAGCGCTG GGACAAGCTC
ACCGACGACG AACAGAAGAT GTTCAAGCGC CAGGTTGATG TCTACGCCGC CTACCTGATG
TATACGGACC ATGAGATCGG CCGCGTCATC CAGGCTGTCG AGGACATGGG CAAGCTGGAC
AACACGCTGG TCATCTATAT CAGCGGCGAC AACGGCTCCA GCGCCGAAGG CACGCTCATC
GGCACCCCCA ACGAAGTCGC CATGTTCAAC GGCGTCGATG TGCCGGTCGC AGACCAGCTG
AAGTATTTCT ATGATGTCTG GGGCTCGGAC AAGACCTACA ACCACATGGC GGTGGGCTGG
ACCTGGGCCT TTTCCACCCC CTTCTCCTGG ACCAAGCAGG TCGCCTCCCA TTTCGGCGGC
ACCCGGCAGG GCATGGCCAT CTCCTGGCCC AAGGTGATCA AGGACAAGGG CGGCATCCGC
TCCCAGTTCC ACCATGTGAT CGACATCGCG CCGACCATCC TGGAGGCCAC CGGCATCAAG
GCTCCCGATA TGGTGGACGG CATCAAGCAG GCTCCCATCG AGGGGGTCAG CATGACCTAT
ACCTTCGACA AGGCCAATGC GACCGCACCC TCCACCCACA AGACCCAGTA TTTCGAGATG
ATGGGCGACC ATGCCATCTA TAACGACGGC TGGATGCTGA GCAGCAAGGT GGTGCGTCCG
CCGTGGGAGG TCCAGGCCGG CCTCGGCCTC GACCCGTCCA AGTACCCATG GGAACTCTAC
AATATTTCGG AAGACTGGAC CCAGTACGAG GACGTTGCCG CCAAGCATCC CGACAAGGTG
AAGGCGATGG CGGACCTGTT CTGGTCCGAA GCCAGGAAGT ACCAGGTGCT CCCGCTGGAC
GCCACCGTCG CCACCCGGCT GGTGACGCCG CGCCCGAGCA TCACCGCGGG TCGCGACGTG
TTCACCTGGA CCGCTCCGCT CACCGGCACG CCCAATGGCG ACGCGCCGTC GGTGCTGAAC
ACCTCCTACC GCTTCACCGC GGACGTGGTG GTGCCCGAGG GCGGCGGCGA CGGCATGCTC
ATCACCCAGG GCGGGCGGTT TGCCGGCTAT GGCTTCTACC TGCTCAAGGG CAAGCCCACC
TTCACCTGGA ACCTCGTGGG CCTGAAGAAG GTGAAATGGC AGGGCACCGA GCCGCTGGCA
CCGGGCAAGC ACACCCTGGT TTTCGACTTC AAGTATGACG GCCTCGGCGC GGCGACGCTG
GCCTTCGGCT CGGGCAGCGG GCTCGGCCAG AGCGGCACCG GGACGCTGAG CGTGGACGGC
AAGGTGGTGG CGACCCAGTC CATGCCCCAC ACCATCCCGC TCATCCTCGC CTGGGACGAG
AATCTGGACG TGGGCTCCGA CACCGGCACG CCGGTGGACG ATGCCGATTA CCAGGTGCCG
TTCGCCTTCA CCGGCAAGAT CAACAAGATC ACGCTCGCGC TTGATCATCC GAAGCTGACG
CCGGAGGATA TCGCCAAGCT GCGCGACGCC GCTGCCAAGG CTGCCGACGG CCCGTCGAAA
TAG
 
Protein sequence
MSRIFALALT ASLTVLGVIP AAAPAGAQKI NGVPGAPDAS IVIDGSQIPA PPQKFEGVIK 
ELAQDSKPYW PARIVPPTGA PNVLLIMTDD VGFGAPSTFG GVIPTPSLDR IANDGLRYTN
FHSTALCSPT RAALITGRNH HSAGFAVVSE MATGYPGYDS IITKDKATIG RILKDNGYRT
SWFGKNHNTP SFQAISTGPF DQWPTGMGFE YFYGFMGGDT SQWQPDNLAR NTTYIYPFQG
NPSFNLTTAM ADEAIAYMNK INTLTPDQPF FVYYVPGGTH APHHPTPEWI EKATQLHLFD
KGWNALREQI FENQKKLGVI PQNAKMTPWP DDLLKRWDKL TDDEQKMFKR QVDVYAAYLM
YTDHEIGRVI QAVEDMGKLD NTLVIYISGD NGSSAEGTLI GTPNEVAMFN GVDVPVADQL
KYFYDVWGSD KTYNHMAVGW TWAFSTPFSW TKQVASHFGG TRQGMAISWP KVIKDKGGIR
SQFHHVIDIA PTILEATGIK APDMVDGIKQ APIEGVSMTY TFDKANATAP STHKTQYFEM
MGDHAIYNDG WMLSSKVVRP PWEVQAGLGL DPSKYPWELY NISEDWTQYE DVAAKHPDKV
KAMADLFWSE ARKYQVLPLD ATVATRLVTP RPSITAGRDV FTWTAPLTGT PNGDAPSVLN
TSYRFTADVV VPEGGGDGML ITQGGRFAGY GFYLLKGKPT FTWNLVGLKK VKWQGTEPLA
PGKHTLVFDF KYDGLGAATL AFGSGSGLGQ SGTGTLSVDG KVVATQSMPH TIPLILAWDE
NLDVGSDTGT PVDDADYQVP FAFTGKINKI TLALDHPKLT PEDIAKLRDA AAKAADGPSK