Gene Slin_4996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4996 
Symbol 
ID8728760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6084962 
End bp6086482 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content53% 
IMG OID 
Productsulfatase 
Protein accessionYP_003389773 
Protein GI284039843 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.317023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGAT TTACTAAATG GAGTGTTGTC ACCGGCATCA TCGGCTGTGT GCTGATTTGT 
ATCGGCATGG CTTCCCGGCC CGCTCAGCAA CCACCCAACA TCGTTTACAT TCTGGCCGAT
GACTTAGGAT ATGGCGACGT GTCGGTCTAT AACCCGGCGG GAAAGATTGC CACGCCCAAT
ATTGACAAAC TGGCCGCGCA GGGTATGCGC TTTACCGATG CGCACTCGCC TTCGGGTGTG
TGTACGCCTA CCCGGTACTC CCTGCTGACG GGTCGTTATC CATGGCGTAG TCGTTTACCC
GTGGGCGTTT TGCGCGGCTA CAGCCGAACA TTGATCGAAG CGGATCGCCC GACGGTGGCC
TCATTGCTGA AAGGTAATGG GTATCAAACG GCGGTCATCG GCAAATGGCA TCTGGGTCTC
GACTGGGTTC CAAAAAAAGG CAGTGAGTCG TTGCTGGCGT CGGCGGAGTA TGGCATCCAA
TCGGAGATGG ACCCGGCGGT GATCGATTTC TCGCAGAATC CAGCGCATGG GCCTAATACA
ATAGGGTTCG ATTATTCGTA CGTGTTGCCC GCTTCGCTCG ATATGCCGCC TTACTGTTAC
CTGGAAAATC ATAAACTGAC CGAGTTACCC ACTGGCTACA CTAAAGGTAA TAAAATAGAG
TCGGGCTACG CGGGTCCTTT CTGGCGCGAA GGCAGTATGG CTCCTTCCTT CGATTTTCAT
GGCGTACTAC CCCGATTTGT TGAGGAAGCG GTTGGTTTTC TGAACCGACA AACGGCAAAA
AAACCATTCT TTCTGTATTT GCCACTGGCG GCCCCGCACA CGCCCTGGAT GCCGACTAAA
GACTATACGG GCAAATCGAA AGCGGGTGAG TACGGCGATT TCGTGCAGCA GGTCGATGCA
ACAGTGGGGG AGGTGTTGGC GGCTCTCGAA AAAACGGGAC TGGCTGGCAA TACACTCGTT
GTTTTTACCA GTGATAACGG ACCGTATTGG CGGGATGATT ACGTGAAGCG TTTCGACCAC
AGGGCCGCTG GCGGGTTCCG GGGGATGAAA GGCGATGCGT TCGAAGGGGG GCACCGCATT
CCGTTTATCG TCCGCTGGCC GGGTAAAGTG AAAGCTGGAA CGGTGAGCCA GGCCACCACA
ACGCTGGCTA ATCTGACCGC TACATGCAGG GAAATTCTGG GTAAGACTAA CCCCAACCAG
GATGATAGTT ACAGTATACT ATCGGTGCTT GCGGGGAAAA CCAGGGATGT ACCGAACCAA
CCGGCTGTCG TGCATAGTTC ATCAATCGGC TTTTTCGCCA TTCGGAAAGG AGATTGGAAA
CTAATCGAAG GGCTGGGGTC GGGCGGTTTT ACGGAACCCA AAGAAATTAA GCCTAAAGCA
GGAGAGCCCG TCGGGCAGTT GTACAACCTC GCCACCGATC AGCTGGAAAC CACCAACATG
TACCAGCAAC ATCCCGAAAA AGTAAAGGAA TTGACGGATT TGCTGGCGAA AATTAAAGAG
GGAAAAGAAC AGTATAAGTA G
 
Protein sequence
MPGFTKWSVV TGIIGCVLIC IGMASRPAQQ PPNIVYILAD DLGYGDVSVY NPAGKIATPN 
IDKLAAQGMR FTDAHSPSGV CTPTRYSLLT GRYPWRSRLP VGVLRGYSRT LIEADRPTVA
SLLKGNGYQT AVIGKWHLGL DWVPKKGSES LLASAEYGIQ SEMDPAVIDF SQNPAHGPNT
IGFDYSYVLP ASLDMPPYCY LENHKLTELP TGYTKGNKIE SGYAGPFWRE GSMAPSFDFH
GVLPRFVEEA VGFLNRQTAK KPFFLYLPLA APHTPWMPTK DYTGKSKAGE YGDFVQQVDA
TVGEVLAALE KTGLAGNTLV VFTSDNGPYW RDDYVKRFDH RAAGGFRGMK GDAFEGGHRI
PFIVRWPGKV KAGTVSQATT TLANLTATCR EILGKTNPNQ DDSYSILSVL AGKTRDVPNQ
PAVVHSSSIG FFAIRKGDWK LIEGLGSGGF TEPKEIKPKA GEPVGQLYNL ATDQLETTNM
YQQHPEKVKE LTDLLAKIKE GKEQYK