Gene Slin_6289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6289 
Symbol 
ID8730073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7629943 
End bp7631403 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content58% 
IMG OID 
Productsulfatase 
Protein accessionYP_003391047 
Protein GI284041117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.235101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACA TACCTATCCG ACTATCACGC CTTGTGCTGA GTGCGATCAC ACTCGTTGGG 
CTTGGACTAT CCATTAGTGC GTGGGTGGAG AAACCCGCTC CGGCAACCCC GCCCAACGTT
GTTCTCTTCT TTATGGACGA CCTGGGATAC GGCGATCTAT CCGTGACTGG TGCGCTGGAC
TACACCACGC CGAATCTGGA TAAAATGGCG GCCGAAGGTA CCCGATTCAC CAACTTCCTG
GCAGCTCAGG CCGTGTGCAG TGCATCGCGG GCGGCTCTGC TCACGGGTTG CTACCCCAAC
CGGCTGGGTC TGTACGGTGC GCTTGGGCCC AACTCGCCCA TCGGCCTGAA CCCGAACGAA
GAAACGCTGG CCGAACTCCT GAAAGAGCGC GGGTATGCTA CCGGCATGTT CGGCAAATGG
CATCTGGGCG ATAACAAGCA GTTTCTGCCC ATGCAGCAGG GCTTCGATGA GTATTATGGC
GTACCGTACT CGCACGATAT GTGGCCGCTG CATCCGGCGC AGGCACAGGC CAAGTACCCA
CCCCTGCGGT GGATTGATGG CAACGAGCCG GGACCGGAAA TAAAAGATTT GAACGATGCC
GGAAAAATCA CCGGCACCAT TACCGAGAAG GCCGTTTCAT TCATTCGGAA TCACAAAAAG
AAACCCTTTT TCCTGTATGT GCCGCACCCG CTGCCGCACG TGCCGCTGGC GACATCGGCC
CGGTTTAAAG GACAAAGTGC CCGGGGTATT TTCGGGGATG TGCTGACTGA ACTGGACTGG
TCTGTCGGGC AGATCATGAA CGAGTTGAAG CAGCAGGGAC TGGATAAGAA TACCCTCGTA
ATTTTTATCA GCGATAACGG CCCCTGGCTT AACTACGGCG ACCATGCCGG TTCGTCGGGT
GGGTTCCGGG AAGGGAAAGG GACGTCCTTC GAAGGGGGCC ACCGGGTGCC CTGCCTGGTG
CGCTGGCCGG GTGTCGTACC CGCCGGTCGG GTGAGTAACA AACTGCTGAC CGCGCTGGAT
ATTCTGCCAA CGGTTGCCAA CGTCTGTGGT GCCCGACTAC CCAAACAACG AATTGACGGC
GTCGATTGGG TGGCGCTGTT AAAAGGCGAT AACTCAGTAA CGCCCCGCGA TAAGTTCTAC
TATTATTATC GGAAAAATAG CCTCGAAGCC GTGCGGCAGG GCGACTGGAA ACTTGTATTC
GCGCACCCCG GCCGGACGTA CGAAGGGTTT TTGCCGGGAC AGGGTGGCAA GCCCGGTCCC
AGCACCGAAA CACACGCGAT TGCTGCCGGA TTATATGATC TCCGTCGCGA CCCCGGTGAG
CGCTACGACG TTCGGGAGCA GCATCCGGAG GTTGTAGCCC GACTCGAAAC GATTGCTGAA
GAAGCCCGCG CTGATCTGGG CGATGAATTA CAGAAGCGTA CGGGTGCCAA CGTGCGCGAG
CCGGGAAGGG TTAGTCAGTA A
 
Protein sequence
MKHIPIRLSR LVLSAITLVG LGLSISAWVE KPAPATPPNV VLFFMDDLGY GDLSVTGALD 
YTTPNLDKMA AEGTRFTNFL AAQAVCSASR AALLTGCYPN RLGLYGALGP NSPIGLNPNE
ETLAELLKER GYATGMFGKW HLGDNKQFLP MQQGFDEYYG VPYSHDMWPL HPAQAQAKYP
PLRWIDGNEP GPEIKDLNDA GKITGTITEK AVSFIRNHKK KPFFLYVPHP LPHVPLATSA
RFKGQSARGI FGDVLTELDW SVGQIMNELK QQGLDKNTLV IFISDNGPWL NYGDHAGSSG
GFREGKGTSF EGGHRVPCLV RWPGVVPAGR VSNKLLTALD ILPTVANVCG ARLPKQRIDG
VDWVALLKGD NSVTPRDKFY YYYRKNSLEA VRQGDWKLVF AHPGRTYEGF LPGQGGKPGP
STETHAIAAG LYDLRRDPGE RYDVREQHPE VVARLETIAE EARADLGDEL QKRTGANVRE
PGRVSQ