Gene Spea_1584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpea_1584 
Symbol 
ID5661981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella pealeana ATCC 700345 
KingdomBacteria 
Replicon accessionNC_009901 
Strand
Start bp1921380 
End bp1923185 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content45% 
IMG OID641236172 
Productarylsulfotransferase 
Protein accessionYP_001501444 
Protein GI157961410 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAAC CCTCGTTAAT TCTATCGGCA ATAGCCCTAT CTCTTAGTCT TTGTAGCAGC 
CAAGTTTTTG CTTCTATCGA TGGCATGAAA CCTAAGCCTG TTGAAGGGGC TCCTTTAGGT
TATATCATTC ATAACCCATA TGAAAATGCG CCACTCACCG CTTTGGTGAC CCTAGCTGGT
CATACCATTT CTGGCGTTGA AGTGACTGTA CATGCCAATG ATGCTGATGG CGTGAGTTTG
ACATACCAAG TTGATGATAT GCGGGTGATG GACGAGGGTG GAGTACCCAT TTTTGGCCTT
TACCCTGCAT TTATGAACAC CTTTACCGTG AAGTGGACGG AAAATGGTGA ACAAAAGTCT
CATGATTATA AGATGCTAAC GCCTGATATC GATATGGGAT TTTCTGAAAG CCAGTGGGCG
AAAGCACCAC TTGTTGAAGT TGAGCATGTT GATGCAGACT TTAAAGACAG ACTGTACTTT
GTAAACTGGA CCAATGCCGA CGGCAAAGCC GCGCCACTGA TGCACAACAA TTCAGATGCA
CCAGGGGCAT TTTCTTGGGA TGGCAAACCG GGTTTCTTTA TTATCGATAC AGCGGGTGAT
ATTCGCTGGT ATATGAACCC TTACACGACT CATGATGCCA AAACCTATGA CAATGCGGGC
TATGCCATGG GGATGAACGT CACTAAAGAC GGCAACATGG TATGGGTACA AGGTCAGGGC
TGGAAGAAAA TGTCGATCAT GGGGCGGATG ATCTCTGAGC ATAGTCTTCC GGGTAACTTT
ATTGACGCCT CTCATGAGGG AATTGAAGGC GCGAACGGTA ATATATTTAT TCGTGCAGCA
GCCAAAGATT ACCGCACAGC AGATGGCCGT TTGGTCAATA CTATTCGCGA CCAAATTATC
GAAGTTGACA ACACAGGCAA GCTGGTGGAT TACTGGGACT TAAATACTAT TTTAGATCCT
ATGCGTGATG CGGCTCTGCT ATCGCTCGAT GCGGGCGCGG TTTGTTTAAA CATCAACCTT
GACGATGCCG GTCACCAAAC CACAGAAGAA GATTTAGCCA AAGCGCCTTA CGGTGATATA
CACGGTGTGG CAACTGGTCG TAACTGGGCC CATGTTAACT CTATTGAGTA TGATCCAACT
GACGACAGCA TCATTATCAG CTCACGCCAT CAATCGGCTG TAATTAAGAT CGGCCGCGAC
AAGCAAGTAA AGTGGATCTT AAGTGCCAGC AAAGGCTGGA GTGAGAAGTT TCAAGATAAG
CTACTTAAAC CTATTACAGC TGATGGCAAG CCAATTTGGT GTAACGAGAA GGGCGCATGT
CAGGACAAGG ATTTTGATTT TAGTTGGACT TCACACACTG CCTATTTAGT CCCAGAAAAA
GGCACATTAA CAGTGTTTGA TAACGGCGAT GGCCGTGATT TAGGTCAGCC GATGTTTGCT
AACGAAAAGT ACTCTCGCTC TGTAGAGTAC AAGATTGATG AGCAAAACAT GACGGTTCAG
CAAGTATGGG AATACGGTAA AGATGAGCTA GGTTATGCAG GTTACTCACC AGTGACTTCT
ATCGTTAAAT ACCAAGCAGA TAAAGACAGT ATGATGAGCT ACTTTGCTTC TGCAGGTTTA
TTTGGTTTAG GTGGTGGTTA CGGCAACCTG AAGATGGATG ACACTACAGG TAAGGTGCAA
TCGATTCTGG TTGAACACAG ATATGGCGAA ACAAAGCCAG CGGTGCGCAT CAATATCGAT
AGCCATGATA TGTTTGCTAC GGGCTATCGC GCTCAGGTTA TTCGCGCTAA TGAAATGTTG
AAATAA
 
Protein sequence
MSKPSLILSA IALSLSLCSS QVFASIDGMK PKPVEGAPLG YIIHNPYENA PLTALVTLAG 
HTISGVEVTV HANDADGVSL TYQVDDMRVM DEGGVPIFGL YPAFMNTFTV KWTENGEQKS
HDYKMLTPDI DMGFSESQWA KAPLVEVEHV DADFKDRLYF VNWTNADGKA APLMHNNSDA
PGAFSWDGKP GFFIIDTAGD IRWYMNPYTT HDAKTYDNAG YAMGMNVTKD GNMVWVQGQG
WKKMSIMGRM ISEHSLPGNF IDASHEGIEG ANGNIFIRAA AKDYRTADGR LVNTIRDQII
EVDNTGKLVD YWDLNTILDP MRDAALLSLD AGAVCLNINL DDAGHQTTEE DLAKAPYGDI
HGVATGRNWA HVNSIEYDPT DDSIIISSRH QSAVIKIGRD KQVKWILSAS KGWSEKFQDK
LLKPITADGK PIWCNEKGAC QDKDFDFSWT SHTAYLVPEK GTLTVFDNGD GRDLGQPMFA
NEKYSRSVEY KIDEQNMTVQ QVWEYGKDEL GYAGYSPVTS IVKYQADKDS MMSYFASAGL
FGLGGGYGNL KMDDTTGKVQ SILVEHRYGE TKPAVRINID SHDMFATGYR AQVIRANEML
K