Gene Slin_5096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5096 
Symbol 
ID8728862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6231575 
End bp6234544 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content48% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389870 
Protein GI284039940 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATACT TTTCCGTAGC CATTGGTGCG GCATTTATTT TATTGCTTCC CCTTTTTGTT 
CTAGCTCAGG TTCAGGTGTC TTTTCCGACG ACACGGGCGG TTTTGCAACG AAACAATTCA
AATCAGGCAA CTATTCGTAT AACAGGGTAT TATACCGCAA CAGTCGGGCG TGTTGAAGCC
CGTTTACAGG CAAGGGATGG TATAGGCTCC TCAACCGATT GGGTAACACT TCAAAATAAT
CCATCGGGAG GGGTATTCAG CGGTGATATA ACGGGCTCAG GTGGCTGGTA CAATCTCGAA
GTACGGGGTA TGAACGGCGA CCAGCAAGTG GGTAATTCAA CAACTGTAGA GCGCGTTGGC
ATTGGCGAAG TATTTGTCGT AGCCGGACAG TCTAATGCAC AGGGTATCCA TCAGGATGCA
CCCAATCCAC TGAATGACCT GGTAAACTGC GTTAACTACC GTTACCCAGA CCAAGGCTTT
CCGAACGAAC CACCCACCCC CGTATTCACC CAACTCGATA ATTCATCGGG TTTTACAATA
GCGCCCAGAG GAATGGGTAG CTGGGCATGG GGGCAGTTGG GCGATATTCT GGCGAAAAGG
TTACGTGTTC CTATTCTATT TTTTAACGCA GCTTTTACAG GAACGTTTGT ACGTAACTGG
CGTGATAGTG CGCCCGAAGG TGGAGTGGCT TATGGCCCTG GCGGGGCCTA CCCTGCCCGT
CAGCCGTACA TTAACTTAAA ACTTGCCCTT CAGTTTTATG CCAATTCCCT CGGCGTGCGC
GCTGTACTTT GGCAGCAGGG CGAATCCGAT AACCTTTACA ATACGTCAAA AGATCAGTAT
GTCAACGATC TTCAGTACGT TATCAATCAG TCTCGGCAGG AATACAACAG TAACACGTCG
TGGGTAGTAG CCCGTGTGAG TTACGGCGAC TTTACCGGTG GAGTTGACCC AGCTATCATT
GACGCTCAGA ATCAGGTTAT CAGCACAACC GCCAATGTAT TTGCCGGGCC TAATACAGAT
GTGATTCAAA TACCACGCCA ACGGCCGCCA CGCAATGATC CGGAGGGTGT TCACTTCGAT
TATAATGGCC TTGTCGATTT GGCAAACGCC TGGAACGCGA GTTTAAACGA TTCTTTTTTT
CAGCGTTCTA CACCTATATC ACCAGTTGCC TCGCCAACAA TCTCTATCGC TTGTGCTTCT
AACAACAACC TTTCTCTTAC CGTAAATGGT AACTACGCCA GTGTGCAATG GGAGTCGGGA
GAATCAGGTA ATAGTATTAC GAAGGGAGCA GGTGTATATC GTGCTAAGGT CAAAGATTCA
CGGGGAAATA CGCTTTTCAC CAATCAGGTG CGGGTATCTG ATGCACCAAT TGCGGCAACG
AGCGATAATA GGCCGCCCTC TGTTTGTATA GGCAGTAGCC TGGCACTAAC AACGAATTAT
GACAATGTAA CCTGGCTAAA CCAGCAGAAT AACACAACGG TAGCAACGTC ACGCAACTTC
TCGACTGTTT CGGCCGGTGC CTACTACGTT CGCTATCGGG ATGTGAGCGG CTGTGAGTTT
ACATCGAATG TATTGAACGT AACGGTAAAC CCATTACCCG ACACGCCTAC AATTACCAAC
GACAAACCAA CGGTATTCTG CCAAGGGGAT AACACGACGC TTCGTGCCAA TGTTGATAAC
ATCCAGTACA ACTGGAGCGA TGGCCAAAAG AATAAGGTGG TGAACGTTGG CAACTCGGGT
TCTTACTTCC TGACGGTAAC GGATCGGAAT GGTTGTACAT CAGCGCAGTC AAACACAATT
GCAGTTACTG CGAACCCCGT ACCCGCTAAA CCTACCATCG CCACGAACGG CCCAACCACC
TTCTGCGCAG ACAGAACCAT TACCCTAACT GCTCCCCAAA ATGTCGCTTA TCAGTGGACA
AGCGGTCAAA CAACCCAGAG CATCACCCTT AGCCAGTCCG GTAATTTTGC GGTTAAGACC
AGTAACCAGT TCGGATGCAC ATCTGAGCAG TCAGATGTGT TGACGATTCA GGTCAATCCT
CTCCCACAAA CCCCATCTAT TACAGCTGGC GGTGCAACAA CGTTTTGCGA AGGCAATCGT
GTTACGTTAA GTGCAAGCAG CAACAACACG ATTGTATGGT CCAGTGGCCA GCGCAGCAAC
AGCATTACTG TTAGTACGTC TGGCAATTTT ACCGTTCAGG CACTTGACCA AAATGGCTGT
TTATCGCCCT TTTCACCGGT TATAGCCGTG AAAGTAAACC CTCTGCCTGC AACGCCAACC
ATACTTGCGG CTCCTTCTCC TATCATTTGT GAAGGAGATA GAGCCACCTT ACGGGTTGAC
GGTCCATATA CTGTTTTTTG GAGCACCGGC GATTCTACCC AGCGCATTAT GACCGGTTCA
GCGGGCAATT ACTCCGCCAA AATCCGGGAT GTTAATGGCT GTGTTTCTGC TCAGGCAGGA
GCCATAACGG TTGAATTAAG ACCACTTCCC CCTTCTCCTA CCATTAATGT CATTGGTACC
TACACCCTTC AGGCGATAAG CTCAACGAAT GGCACCGTAT TCCGCTGGCG GGTGGGTACT
GATTCGCTAG CGGCACAAAC GGCCATTATT AAAGCAAATC AATCTGGTTC CTATACGGCG
CGCGCGTCAA TCGTCTACTC ACAAGCACTA ACCTGCTTCT CGTTACCATC GGCTCCATTC
GCTTTTACGG TCGATGTAAG CAATAAGGGA TTAAGTGTTT ACCCGAATCC TAATCCGGCT
AAAATTATCA CAATAGAAAC ACTGGCTAAC CTGACAAATG CCGTTATCAC CATTTATACC
ATCAATGGTC AGATAGTCTT CACTACACCG GTTCCCTCCC TGGATGAGCG AAAACAATTG
GTTTTAACCA GTTTGACCTC AGGCTCTTAC ATTTTACGTG TACAATCGGC TGATTTTGAC
GTTTCAAAGC GAATTATACT CGGATTGTAA
 
Protein sequence
MRYFSVAIGA AFILLLPLFV LAQVQVSFPT TRAVLQRNNS NQATIRITGY YTATVGRVEA 
RLQARDGIGS STDWVTLQNN PSGGVFSGDI TGSGGWYNLE VRGMNGDQQV GNSTTVERVG
IGEVFVVAGQ SNAQGIHQDA PNPLNDLVNC VNYRYPDQGF PNEPPTPVFT QLDNSSGFTI
APRGMGSWAW GQLGDILAKR LRVPILFFNA AFTGTFVRNW RDSAPEGGVA YGPGGAYPAR
QPYINLKLAL QFYANSLGVR AVLWQQGESD NLYNTSKDQY VNDLQYVINQ SRQEYNSNTS
WVVARVSYGD FTGGVDPAII DAQNQVISTT ANVFAGPNTD VIQIPRQRPP RNDPEGVHFD
YNGLVDLANA WNASLNDSFF QRSTPISPVA SPTISIACAS NNNLSLTVNG NYASVQWESG
ESGNSITKGA GVYRAKVKDS RGNTLFTNQV RVSDAPIAAT SDNRPPSVCI GSSLALTTNY
DNVTWLNQQN NTTVATSRNF STVSAGAYYV RYRDVSGCEF TSNVLNVTVN PLPDTPTITN
DKPTVFCQGD NTTLRANVDN IQYNWSDGQK NKVVNVGNSG SYFLTVTDRN GCTSAQSNTI
AVTANPVPAK PTIATNGPTT FCADRTITLT APQNVAYQWT SGQTTQSITL SQSGNFAVKT
SNQFGCTSEQ SDVLTIQVNP LPQTPSITAG GATTFCEGNR VTLSASSNNT IVWSSGQRSN
SITVSTSGNF TVQALDQNGC LSPFSPVIAV KVNPLPATPT ILAAPSPIIC EGDRATLRVD
GPYTVFWSTG DSTQRIMTGS AGNYSAKIRD VNGCVSAQAG AITVELRPLP PSPTINVIGT
YTLQAISSTN GTVFRWRVGT DSLAAQTAII KANQSGSYTA RASIVYSQAL TCFSLPSAPF
AFTVDVSNKG LSVYPNPNPA KIITIETLAN LTNAVITIYT INGQIVFTTP VPSLDERKQL
VLTSLTSGSY ILRVQSADFD VSKRIILGL