Gene Shewana3_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_4012 
Symbol 
ID4480226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4815278 
End bp4817242 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content52% 
IMG OID639728627 
Productsulfatase 
Protein accessionYP_871635 
Protein GI117922443 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCAG GTTTATCCGC TCGTGGACGT CAGCCCGCCC ATGGGCCGTT TCGCGTCATT 
CTTATTTTCA GTCTGTTAGT ACTCGCTATT GCTACGGCTA GCCGCATTGG CTTAGGCCTG
TGGCAGGGCG AACGCGTGGC CGCCGTTGAC GGTTGGTCGC ATCTGTTACT GCAAGGTATT
CGTGTCGATA TCGCAACCCT GTGTTGGTTA TGGGGCGTCG CTGCCTTAGG TACGGCGTTG
TTTTCAGGCG ATCATCTGCT AGGCCGCATT TGGCAAGGGG TGCTGCGCCT ATGGTTAACC
TTAGGTCTGT GGGCCATTGT ATTCCTTGAG GTTTCAACGC CAGCCTTTAT TGAAGAATAC
GGCATTCGCC CGAACCGCTT GTATGTGGAG TATTTGATTT ACCCCAAAGA AGTGCTGTCG
ATGCTGTGGG CGGGACGCAA GCTGGAGCTG ATTTTCTCTG TGCTGCTCAC CCTCGCGACG
CTGTGGGGCG GTTGGACACT CAGTGGTAAG TTGACTAAGA ATCTGCGTTT CCCACGTTGG
TATTGGCGTC CTGTGCTCGC TGTCATGGTG ATAGTGGTCA CTCTATTAGG CGCGCGTTCA
ACCTTAGGCC ATCGTCCAAT CAACCCCGCG ATGGTGGCAT TTGCCGACGA TCCATTAGTG
AACTCCCTCG TGATTAACTC CGCTTATTCG TTAGTGTTTG CGATTAAGCA AATGGGCAGC
GAAGAAGACG CCTCTAAGGT GTATGGCTCA TTGGATAGAG ATGAGATCAT CAACACAATC
AGGCAAGAGA GTGGCCGCCC TGAGAGTGCC TTTACTTCGA GCGATATCCC CTCATTAAGC
TTTAACCAAG CCAGTTACAA CGGTAAACCG AAAAATCTTG TGATCCTGTT GCAGGAAAGC
TTAGGCGCCC GTTTTGTGGG GAGTTTAGGC GGTATGCCTT TAACGCCGAA TATCGATGCG
CTCTCACAGG AAGGTTGGTA TTTCGATAAT CTTTACGCCA CTGGCACACG TTCAGTGCGC
GGAATCGAAG CGGTGACGAC AGGTTTTACC CCAACGCCGG CCCGCGCCGT GGTTAAACTC
GGTAAGAGTC AGACGGGCTT TTTCACCCTC GCCGAATTGC TGAAAAACCA CGGCTATACC
ACCCCATTTA TCTACGGCGG TGAGAGTCAT TTCGACAATA TGCGCAGCTT CTTTCTCGGT
AATGGATTTA GCGACATTAT CGACCAGAAG GATTATCAGT CCCCGGCCTT TGTTGGCTCC
TGGGGCGCAT CCGACGAAGA TTTAATGCGT AAGGCGAACA GTGAGTTTGA GCGTCTTCAC
AGTGAAGGTA AGTCATTCTT TAGCTTAGTC TTTAGCTCGA GTAACCACGA TCCATTCGAA
TTCCCCGATG GCCGTATTGA GCTGTATGAA CAACCTAAGC AGACCCGCAA TAACGCGGCT
AAATATGCCG ATTATGCAAT TGGTGAGTTC TTCAAACTGG CGAAAAATGC TGACTACTGG
AAGGACACTG TCTTCATCGT AGTCGCCGAC CACGACAGCC GCGTTGGTGG GGCCGATCTG
GTGCCTGTAT CGCGCTTCCG TATTCCGGGG CTAATTATTG GGGATAATGT TGCGCCAAAA
CGCGATCACC GTGTCGTGAG TCAAATCGAC TTGCCGCCAA CTCTGTTATC TTTGATTGGC
ATTTCAGACT CTTACCCTAT GCTAGGCCGT GACTTAACTC AGGTTAGCGA TGATTGGCCG
GGGCGTGCGC TGATGCAATA CGATAAAAAC TTTGCCCTGA TGGAAGGTAA AGATGTGGTT
ATCCTGCAGC CAGAAAAAGC GGCGCAGGGC TTCCAGTATG ATGAAAAGAC TGAGCACTTA
ACGCCTTATG CCCCAGCGGC GCAGGCGCTG GAGAAAAAGG CCTTAGGTTG GGCCCTGTGG
GGCAGCCTAG CCTACCAGCA AGAGCTGTAT CGCTCGGGTA AATAA
 
Protein sequence
MQSGLSARGR QPAHGPFRVI LIFSLLVLAI ATASRIGLGL WQGERVAAVD GWSHLLLQGI 
RVDIATLCWL WGVAALGTAL FSGDHLLGRI WQGVLRLWLT LGLWAIVFLE VSTPAFIEEY
GIRPNRLYVE YLIYPKEVLS MLWAGRKLEL IFSVLLTLAT LWGGWTLSGK LTKNLRFPRW
YWRPVLAVMV IVVTLLGARS TLGHRPINPA MVAFADDPLV NSLVINSAYS LVFAIKQMGS
EEDASKVYGS LDRDEIINTI RQESGRPESA FTSSDIPSLS FNQASYNGKP KNLVILLQES
LGARFVGSLG GMPLTPNIDA LSQEGWYFDN LYATGTRSVR GIEAVTTGFT PTPARAVVKL
GKSQTGFFTL AELLKNHGYT TPFIYGGESH FDNMRSFFLG NGFSDIIDQK DYQSPAFVGS
WGASDEDLMR KANSEFERLH SEGKSFFSLV FSSSNHDPFE FPDGRIELYE QPKQTRNNAA
KYADYAIGEF FKLAKNADYW KDTVFIVVAD HDSRVGGADL VPVSRFRIPG LIIGDNVAPK
RDHRVVSQID LPPTLLSLIG ISDSYPMLGR DLTQVSDDWP GRALMQYDKN FALMEGKDVV
ILQPEKAAQG FQYDEKTEHL TPYAPAAQAL EKKALGWALW GSLAYQQELY RSGK