Gene Spea_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpea_2007 
Symbol 
ID5662400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella pealeana ATCC 700345 
KingdomBacteria 
Replicon accessionNC_009901 
Strand
Start bp2431160 
End bp2433208 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content38% 
IMG OID641236602 
Productsulfatase 
Protein accessionYP_001501862 
Protein GI157961828 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.21816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAGT CTTCAGAACC ACATCAGCCA AGGATTAGTC TGACTCGGGT CTACTTTAGA 
ACTCTGGGCA TTTTACTTGT ATTTACTCTG CTTTTATTTA ACTTTAAGTC CTCACTTCAA
ACCTACTCGG TAATGTCACT TGCCACGACA ATGAATAGCC TAGACAGCGA TTTTTTATCT
GCTCGCGGAT TAATTTTAGA CGTCAGCTAT TTTTTCGCAA TTCTATTACT GCTCCATATT
ATTTGGTCAG GCATAATAAC TATTTCAGCT CTAGCGCCTT GGTACCAAAA TAAAGATAAA
CATCAATCCG ATGTTTATTG GTTATTACTT ATATGCCTTC ATATTACTTG TTTGATTGCA
ATAAACTCTT ATTTATATCC AACGTCATTA GTTTCATACT TCAGACACAC GCTTCTTAGC
AATCCACTAA TTTGGGGAGG GATTACATTC TTTTTGATCT TAAGCTTTTA TCGTGGTTTA
CTGAGCTTAA TGTCTAAGGC TGTAGTCACT TCTACCTCGA TATTAGCAGC CATCGTTCTC
ATCTCCCCTT TCGCCTCAAC CAACAAAACG AGTACTGACA ACTCTGACAA ACCAAATATC
ATTATTTTAG GAATCGACGG ATTGAGGCCT GATCATTTGG AATACTTAGC CGCAGACCCG
AAAATCGCGC CAAACTTGAA CCGTCTACTC AATAAAATGA CCATATACTC AAATACATAC
ACCCCTCAGG GGCGAACTTA CGTTGCGTGG ATGAGCCTCC TCACAGGCCA GTATCCTGTC
AGTAATGGCG TTAGATTTAA TTTAGCCCCG CCAGAATTGG TTGATAAAAC ACTACCAATC
ATTGAGTTAT TAAAATCAAA AGGTTATCAC ACAACTTATG CTATAGATGA GCGTCGTTTC
AATCAAATAG ACTCAAGTTA TGGTTTCGAT AATACTGTTG GGCCCAAGGC TGGTGCAGCA
GATGCTTTTA TAACTGGTTT AGGCGACCTA CCTTATTTTA ATATTATGCT AATCCATCCC
TACTCAAGTA AAATACTCCC ATACTTATAC AATAATAGAG CCTATGGAAA AGCTTATTCT
CCAGTAACAT TTAATAAAAC CGTCATCGGA AGCCTACCCC CTGAAAAACC AAATTTTCTG
GCAATGCACT ATTGTCAATT ACATTGGCCT TATACGTCTA AAGACTTCAT ACCACAGCCA
TTAGACAGTT GGGATGGCAA TTATAATCAC TATATGTATA CACAAATGAT ACTCAGCGTT
GATAAGCAAG TTAATGACCT CTTTAGTCGC TTAAAACTAA AAGGCATGCT CAACAATGCA
ATTGTTTATA TCATTTCTGA TCACGGTGAA AGCTTTAAGC TAAGTGACCA CCAAGCAATA
AACACTCAAA ATTCAAGTTC AATGCCAACA ACTAAGTCGT GGGGGCACGG CACTAATATT
TTGGACCAAC AGCAAACTCA GGTTCTATTA GCTCGAGCAG ATTTCAGCGA TGGCAAGATA
ATTACTGCAG CAACAGTGAT GGATGGACTT TACTCTTTAG TCGATATTGT GCCGACATTA
TTAAGCTCAT TAAATGTTTC AACTGAATCA ATTGCAAGTC AACAAGCCAT TCAATTAGAT
GGAGAGGTTC TTCCCAAAAC TGCAGATGAT GTTTTACTCA ATAGATACGT TTATGTGGAG
TCTTCCGTCC CTGTTAAATC CATAAATAAG AGCTTTATCG ACAAAAAAGA TGTCATGTCA
GAGACTGCTT CAAATTACGA AGTGAGAGAT GATGGTAAAG CCTATATGCG TCCGCAGAAC
TACATAGAGC TTATCGCGAA AAAACAGCGT GCTATATATT TTCAACACTG GCAATTAAGC
CTATTACCAG ACTTTGACTC CCCAATACTT TTAAATACTA AAACAAACGA AATTTATGTC
GCTGATGAGT ATCATGGTAA TTTTAACTGG CGACCTATGC TTAATGCGCT ATGTAATCGA
TACAAGGGAG ATCCGGGGTT TGATCCTGGC TCTTTTTGCA ATCAGATAGA TGTCGTAGTC
ACTCATTAA
 
Protein sequence
MIESSEPHQP RISLTRVYFR TLGILLVFTL LLFNFKSSLQ TYSVMSLATT MNSLDSDFLS 
ARGLILDVSY FFAILLLLHI IWSGIITISA LAPWYQNKDK HQSDVYWLLL ICLHITCLIA
INSYLYPTSL VSYFRHTLLS NPLIWGGITF FLILSFYRGL LSLMSKAVVT STSILAAIVL
ISPFASTNKT STDNSDKPNI IILGIDGLRP DHLEYLAADP KIAPNLNRLL NKMTIYSNTY
TPQGRTYVAW MSLLTGQYPV SNGVRFNLAP PELVDKTLPI IELLKSKGYH TTYAIDERRF
NQIDSSYGFD NTVGPKAGAA DAFITGLGDL PYFNIMLIHP YSSKILPYLY NNRAYGKAYS
PVTFNKTVIG SLPPEKPNFL AMHYCQLHWP YTSKDFIPQP LDSWDGNYNH YMYTQMILSV
DKQVNDLFSR LKLKGMLNNA IVYIISDHGE SFKLSDHQAI NTQNSSSMPT TKSWGHGTNI
LDQQQTQVLL ARADFSDGKI ITAATVMDGL YSLVDIVPTL LSSLNVSTES IASQQAIQLD
GEVLPKTADD VLLNRYVYVE SSVPVKSINK SFIDKKDVMS ETASNYEVRD DGKAYMRPQN
YIELIAKKQR AIYFQHWQLS LLPDFDSPIL LNTKTNEIYV ADEYHGNFNW RPMLNALCNR
YKGDPGFDPG SFCNQIDVVV TH