Gene Slin_3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3587 
Symbol 
ID8727340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4342181 
End bp4345444 
Gene Length3264 bp 
Protein Length1087 aa 
Translation table11 
GC content51% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388393 
Protein GI284038463 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.137997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAATC AATACATACT GATGTTAGGC TTGTGGCTGG CTGGATTGAG CATGGCTACG 
GCACAAAACT ATCCGTCGCT GGAGCGATTT GGGAAAAATC GTGTGCAGTA CCGGAGCTTC
GAATGGAAAA TTATCCGAAC TGCCAACTTT GAAATCTATT ATTATCAGGA TGGCAACCAG
ATTGCTAACC TGACGGCGCA GTATGCCGAG TCGGAGTTTG ACCGCATAAC GGAGTTGCTT
GGCTATACCC CTTACAATCG GGTCAAAATA TTCCTCTTCA ACTCGCCAGA GGAGATGGCG
CAGAGTAATA TCGGCCTTCA GGGAGGGCTG AGCAGCCGCG AACAAAACCT GTCCAAATCC
CGCGTTGAAC TGGCCTTTAC CGGCGATCAG ATCAGTTTTC GGCAGCAGAT TATCCGCGAC
ATCTCCATGC TTTTCGTGTA CGACATGTTG TATGGCGGCA GTCTGAAAGA CGCTTTGCAA
AGTTCACTCC TGCTGACCCT TCCCGACTGG TTCATGCCCG GCATCGCGTC GTACATCGCT
CAGGGAAATA GTCTCGAACT GGACGATTAC ATGCGGGATG TATCCCTGAA CCGGCCGGTT
AAAAAACCGT CGCTGCTGTC TGGAGCGGAT GCCGAACGGG TTGGGCATTC CATCTGGAAT
TACATCGTTC AGCGGTACGG GCGCGATAAT GTCTCCAATA TTCTGAACCT CACCCGGATT
ATCCGCAATG AGCAGAATAG CATTTCGAGT ACGCTCGGCG TACCCTATAA CCGCTTTCTG
CGCGAATGGC GTGAATATTA TGCCGGTATG GCCAATGCTG TAAATCAGTC GTATCGAGCC
AGTACAGACG ATTTTCAGAT TAAAGTAGGT TCGGCAGACG ATAAGTCGTT GTTGATCAGC
CTGAAGCTTA GTCCTGATAA ACAGTTTATT GCTTACTCGC TTCTGCGCGA CGGGAAGTTT
AGCGTAGAGG TCGTTAACAC GGCCAATCGG AAACGGCATA CCGTACTGAC GGGCGGCTAT
CGGTTGGATG GGCAAATTAA CCGAACCAGC ACCCCCTTAC TGGCGTGGCA GCGGGATAAC
AATCTGCTTG TCGTGACCGA TGAACTGGGA AAAACAAATC TGTATCAGTA CAGCGATTTC
GAAAAGCGGC CCAAACGGCA ATTTAAACGG CAGGTGAATG GATTGTCGCA GGTTGTCTGG
ATGAATGCTT CCGAAGATGG CGGCAGCCTG ATTATGAGTG CCGACCGAAA AGGGCAGAAC
GACCTCTTCC TGTACAGCAT CAACCGGGGT TCCTACCAGC AATTGACCAA CGATCTGTAC
GACGATCTGT ATCCAGCTTT TGTAGGCCGT ACTGCCCGAC AGGTAGTTTT CAGCTCAAAC
CGCCGACAGG ATACGCTGGG CGTTGACAAA GGGTCTTACC GAACCATACG CGACCAGTTG
AGCCTGTTCT CTCATGAGGG AAGTGCCCGT GATCTGTCAC TTGTGCGATT GACGGATTCG
CTGGGTCAGG CCACGCAGCC CATTCCTGCC GGTGAAACAA CCGTTTATTT TTTGAACGAC
GTCAGCGGTA TTCGAAATTT ATACCGGCTC GAAACCGAAT CGAAAAATGT CAGCCAGTTA
ACGGCTTTTC CAGAAAGTAT TCGGCTGTAT GACCTTAAGC CGGGAAATGG CGGGTTTGTC
TATAGTAGTC TTAAAAACGG TGACGAGTAT ATAGGATTCC GTTCGCAGTT CAATCTGTCG
CAAACGGCAC AGGCGCCCCC TACCCAGCGA AGTGTAGCCA TCAATCGGTT AGCTGCTTCA
AGCACACCCA GAGCTGCTCA GGCTAAAACC GATACAGCCT CCGTACCAAG ACGAACTGCG
CCCGACTCAA GTACGGCGGG CCGCGTAGCG CCGGGTGCTC TGGCGTACAC GCCTAAACTG
GCGCTCGAAC CCGGCGAAGT AGACACGGAC AACTATCAGT TTGATCCCGA AGTAGTTAAA
GCCGCTGAGT TTCGGCAGCG CCGTTCAGTT GCGGGGGTAT CGCCCGGTTT AACTACGGCT
CCCCCCCGGA ACCGACGTCG CGAGAACATC ACGATCCGTG GACCGTTCGA CTATAAAGCA
ACATTCGGCG TAAACGAAGC ACCGTCTAAC TGGCGCGTAG ACCCTATAAA AGGGTTCGGA
TACGCACAGG AAGTAACACT GACCGACTTG CTGGAGAACC ACGTACTGCG TGCGGGTGGA
TTTATCAGTC TAACCAACAC CCTACGGAAT AGCGATTTGT TTGCGGAGTA CACGAACCTC
ACCCATCTTA TTGATTTCGG AGCCAGAGTC GACCGCCAGA CCCTATTTGT AGATGGATCG
GGAATTCTTC AAAAATATCG ATACAATAGG GTTGCCTTAT CGGCTTCTTA TCCTATCTCA
GTCAATAGCC GGTTTACTGT ATCACCATTT TATGCCATTA CCCGGTTAAT TGACCTGTCT
TCCTTCGCCG AACCAGACCG CGTATCCGAT TATGCCGGCT TACGGGGTGA ATTTGTATTC
GACAATACAA ACGTAAACGG GATGAACATG ATCGTGGGTA CAAAGGCTAA ACTTCGGTAT
GAAGAATATG CCGGGCTGCG GGGTAAATCA GAAGGGTTCC GACGGCTCTC GCTGGATTTA
CGCCACTACC AGCGCCTGCA CCGCGACTTA ATCCTGGCTA CTCGTTTTGC CTTTAGTCAG
TCGGGTGGGG CCGCCCCGAA GAAAAGTACA CTCGGCGGCA TGGAGAACTG GGTTGGCGGT
CAGAAAGAGC TAATTGCGTC AAACCCGCTG CTGGTTCCGA ATGCTACGCA AAACGAAATT
CCGTACGATT ATCGGGACGT TTTCTTCCTG GATTTTGCGG CTCCTCTACG TGGGTTTAAT
CAGGGTAAAC TGACGGGCAA CAGTTATATG CTGTTCAACG CCGAACTTCG TTTGCCGTTG
GTTCGCTATT TATACAGAGG CAATATCACG TCTAACTTCC TCCGGAATTT ACAACTGGTA
GCCTTTACGG ATATTGGCAC GGCCTGGACC GGAAGTGGTC CATTCAGCCA GCAGAATAGC
CTTAATACGG AGGTCGTTGG TGGCGGGAAC ATACCATTCA GAGCGACAGT TACAAACTTT
AAGAATCCGT TTCTTATTGG CTATGGAGCC GGTGTTCGGA CTATGATTTT CGGCTACTTT
GTGAAGTTTG ACTACGCCTG GGGTCTTGAA GATAAAACGG TGGGCAAGCC GATACCTTAC
CTGACACTCG GTTACGATTT TTAA
 
Protein sequence
MRNQYILMLG LWLAGLSMAT AQNYPSLERF GKNRVQYRSF EWKIIRTANF EIYYYQDGNQ 
IANLTAQYAE SEFDRITELL GYTPYNRVKI FLFNSPEEMA QSNIGLQGGL SSREQNLSKS
RVELAFTGDQ ISFRQQIIRD ISMLFVYDML YGGSLKDALQ SSLLLTLPDW FMPGIASYIA
QGNSLELDDY MRDVSLNRPV KKPSLLSGAD AERVGHSIWN YIVQRYGRDN VSNILNLTRI
IRNEQNSISS TLGVPYNRFL REWREYYAGM ANAVNQSYRA STDDFQIKVG SADDKSLLIS
LKLSPDKQFI AYSLLRDGKF SVEVVNTANR KRHTVLTGGY RLDGQINRTS TPLLAWQRDN
NLLVVTDELG KTNLYQYSDF EKRPKRQFKR QVNGLSQVVW MNASEDGGSL IMSADRKGQN
DLFLYSINRG SYQQLTNDLY DDLYPAFVGR TARQVVFSSN RRQDTLGVDK GSYRTIRDQL
SLFSHEGSAR DLSLVRLTDS LGQATQPIPA GETTVYFLND VSGIRNLYRL ETESKNVSQL
TAFPESIRLY DLKPGNGGFV YSSLKNGDEY IGFRSQFNLS QTAQAPPTQR SVAINRLAAS
STPRAAQAKT DTASVPRRTA PDSSTAGRVA PGALAYTPKL ALEPGEVDTD NYQFDPEVVK
AAEFRQRRSV AGVSPGLTTA PPRNRRRENI TIRGPFDYKA TFGVNEAPSN WRVDPIKGFG
YAQEVTLTDL LENHVLRAGG FISLTNTLRN SDLFAEYTNL THLIDFGARV DRQTLFVDGS
GILQKYRYNR VALSASYPIS VNSRFTVSPF YAITRLIDLS SFAEPDRVSD YAGLRGEFVF
DNTNVNGMNM IVGTKAKLRY EEYAGLRGKS EGFRRLSLDL RHYQRLHRDL ILATRFAFSQ
SGGAAPKKST LGGMENWVGG QKELIASNPL LVPNATQNEI PYDYRDVFFL DFAAPLRGFN
QGKLTGNSYM LFNAELRLPL VRYLYRGNIT SNFLRNLQLV AFTDIGTAWT GSGPFSQQNS
LNTEVVGGGN IPFRATVTNF KNPFLIGYGA GVRTMIFGYF VKFDYAWGLE DKTVGKPIPY
LTLGYDF