Gene Slin_3375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3375 
Symbol 
ID8727128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4076680 
End bp4078395 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content54% 
IMG OID 
Productsulfatase 
Protein accessionYP_003388182 
Protein GI284038252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.590124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG GTTTATTCGT ACTGGTAGTA CCGTTGATGG CACTGGTATC TATACTGATA 
CTGGGCGGAT TCGAGTGGAA ACAAGCAACA TCGCCAAGCC ATAAACAAGC ACCCAGACCC
AACATTATTG TCATCATGGC CGATGATATG GGCTACTCAG ATCTGGGCTG CTACGGGGGC
GAGATCCACA CACCAAATAT TGACTACCTG GCGAACAACG GCATTCGCTA CACGCAATTT
TACAATACAT CGCGCTGCTG TCCAACCCGG GCGTCGTTGC TTACCGGCCT CTACAATCAT
CAGGCGGGTA TCGGCAAAAT GACGGATGCC GAAGACGAGC CGGGGTATCG CGGCCATTTG
ACGGAGAACA CTGTTACGCT GGCCGAAGTC CTCAAATCGG CAGGCTATCA GACGGGTATG
ACCGGTAAGT GGCACGTTTC CAATACCAAT GTGCAAAAGA ATCCGCAGGA ACAGCTCGAC
TGGCTGAACC ATAAGAAAGA CTATGGCGAT TTTGCGCCTA TCAGCCAGTA CCCAACCAGC
CGGGGGTTCG ATAAATACTT TGGTAACATC TGGGGTGTGG TCGACTTCTT CGACCCGTTC
AGTCTGGTGA GCGGTACCAA ACCGGTTAAG GAGGTGCCGA AGAACTATTA CCATACCGAC
GCCATTAGCG ATACGACGGT GGCCTACATT AAATCCTTCG CCAAAACATC GTCGCCATTT
TTTATCTACG TGGCCGAAAC CGCCCCGCAC TGGCCCCTGA TGGCCTTGCC TGAAGATATT
GCGAAGTACA AGGATACATA CAAACCCGGT TGGGAAGCTA TTCGGAAAGC CCGCTACCGG
AAAATGAGCA AGCTGGGGTT GATCGATTCG ACCAAAACGA AGCTCTCCAA ACGCTGGCAG
GATAATCTGA CCTGGGCCAA CAACCCCGAT AAGGATTGGG ATGCCCGGGC AATGGCCGTT
CATGCCGCCA TGATCGACCG GATGGACCAG GGAATCGGTC GTATGATCAA GACCCTGCGA
GAAACGGGAC AGTTGGATAA TACGCTAATC CTGTTTTTGT CCGACAATGG GGCCAGCCCG
GAGAACTGTG CGGCCTACGG TCCCGGCTTC GACCGCCCCA ACGAAACCCG CGATGGCCGT
AAAATCGTGT ACGACTTGAA AAAACAGGTT CTACCCGGTG CCCAAACATC CTACGCATCC
ATTGGGCAGC GGTGGGCCAA TGTGGCCAAC ACGCCTTATG CCTTCTGGAA AGCAGAATCG
TATGAAGGCG GCATTCGTAC CCCGCTGGTT GCCTTCTGGC CAAAGGGAAT AACAGCCCAA
AAAGGCAGTT ACAGTACGCA GGTAGGGCAC GTGATGGATT TTATGAAGAC GTTCCTCGAC
CTGACCGGCG CTGCGTATCC CGCCACGTTT AAGGGACACA CCATTACCCC AACAACGGGC
GTCAGCCTAC TGCCTTCCTT CAGTGGAAAG GCCTCCATTG GGCACGAGAC CTTGTTCAAC
GAGCATTTTG GGGCTCGCTA CGCCCGTTCG GGCAACTGGA AACTGGTGTC GTCGAGCCGA
GACAGCACCT GGAGTCTATT CAATCTGGCC ACCGATAAAT CGGAAACGCA GGATCTGGCA
GCCAGATACC CCGAAAAAGT TCGTCAGCTT CAGGGCTTAT GGCAGCAGTG GGCCAGTGCG
CATCAAGTAT TCCCGAAACC CGGCAGAAAG AACTAG
 
Protein sequence
MKKGLFVLVV PLMALVSILI LGGFEWKQAT SPSHKQAPRP NIIVIMADDM GYSDLGCYGG 
EIHTPNIDYL ANNGIRYTQF YNTSRCCPTR ASLLTGLYNH QAGIGKMTDA EDEPGYRGHL
TENTVTLAEV LKSAGYQTGM TGKWHVSNTN VQKNPQEQLD WLNHKKDYGD FAPISQYPTS
RGFDKYFGNI WGVVDFFDPF SLVSGTKPVK EVPKNYYHTD AISDTTVAYI KSFAKTSSPF
FIYVAETAPH WPLMALPEDI AKYKDTYKPG WEAIRKARYR KMSKLGLIDS TKTKLSKRWQ
DNLTWANNPD KDWDARAMAV HAAMIDRMDQ GIGRMIKTLR ETGQLDNTLI LFLSDNGASP
ENCAAYGPGF DRPNETRDGR KIVYDLKKQV LPGAQTSYAS IGQRWANVAN TPYAFWKAES
YEGGIRTPLV AFWPKGITAQ KGSYSTQVGH VMDFMKTFLD LTGAAYPATF KGHTITPTTG
VSLLPSFSGK ASIGHETLFN EHFGARYARS GNWKLVSSSR DSTWSLFNLA TDKSETQDLA
ARYPEKVRQL QGLWQQWASA HQVFPKPGRK N