Gene Slin_5057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5057 
Symbol 
ID8728822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6163737 
End bp6167237 
Gene Length3501 bp 
Protein Length1166 aa 
Translation table11 
GC content53% 
IMG OID 
ProductASPIC/UnbV domain protein 
Protein accessionYP_003389831 
Protein GI284039901 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.279918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCA GTTTACTAGT ACTGGCAATA GGCCTGATTG CCGGTTGCCG TTCAGAACAG 
CAACAACCGG CAGACCCGCT GTTCCGAAAA CTACCCGCCG AAGAAACCCA CGTTAATTTT
AGTAATACCA TTACCGACGA CAAAGACTTT AACGTCTTCA ATTATCGCAA TTTCTACAAT
GGCGGAGGAG TCGCTATCGG CGATGTAAAC AACGACGGCC TGTCAGACGT GTTGCTGATA
GCCAATATGG GCGACAATAA ACTGTACCTG AACCGGACAG CGACGAGTAA AGCCGGGTCG
GAACAGCCTG GAATTCAGTT TGAAGATGTT ACCGCCAAAG CGGGTGTGGC CGGTAAACGG
GCCTGGAGTA CGGGCGCCAC CTTTGCCGAT GTAAACGGCG ATGGACGGCT GGATATCTAC
ATTTGCAATG CCGGTAACCG CGACGGCGAC GACCGGGCCA ATGAACTCTT CATCAACAAC
GGCAACGACG CCAGCGGTAT CCCCACGTTT ACCGAACGGG CAGCCGAATA TGGTCTCGAC
GACCGGGGTT ATTCCACCCA TGCCGCTTTC TTCGACTACG ACCGCGACGG CGATCTGGAT
ATGTACCTAC TCAACAACAG CTTCATGCCC GTTGGGAAAC TGCAATACGC TAACCTGCGG
TCGCAACGCG ATTCGCTGGG GGGGCATAAG TTGTTTGAAA ATAGGAGCCA GGGGTATTCT
TTGAGGAGCG AGGAGCGAGG AGCGAGAAGT GAGGAGCCGT CCGGTGGTGG TAGAAAAAAA
GGGAGTGGGG AGAAAAAAGG GAAGACTTCG GGGCCGGTGT TTGTGGATGT GTCGGAGGAG
GCTGGGATAT ATGGGAGTTT GATTGGCTTT GGGCTAGGGA TTACTATAGG TGATGTGAAC
GATGATAACT GGCTCGATAT TTATATTTCC AACGACTTCT ACGAGCGTGA CTATCTCTAT
ATCAACAACC ATGATGGCAC GTTTAAGGAG TCGATCAAGG AGTCGATGCC GCATACGAGT
CTGTCGTCCA TGGGGGCCGA TGTTGCCGAT GTCAACAACG ACGGCCGTCT CGATATTTTC
ATCACCGATA TGCTACCCGG CAACGACCGG CGGCTAAAAC GAACCTCCAG CTACGAAAGC
CACGACCTGG AGCAGATTAA AGTGGGCCGG GATTTTCATT ACCAGTTCAT GCAGAATATG
CTGCACCTGA ATCAGGGTAA TCAGCCGGGA AATGGTGGCT TGCCCGTTTT CAGCGACATC
GCCCGCTTTG CCGGGGTGCA GGCCACCGAC TGGAGTTGGG GGGCGCTGCT GTTCGACATG
GACAACGACG GCCAGAAAGA TATATTCGTA GCCAACGGCA TTGCTAAAGA CGTAACCGAT
CAGGATTTCG TGAACTTCCT GGCCGACCGC GAAAACATGG CGCAGATTGC CCGGCAACGG
GCCTTTAATT TTAAAGAGTT TCTGGACAAA GCTCCGTCGG AAGGCGTGCC GAATTACGCT
TTTCACAACG ATGGAAATTT ACAGTTTACC AATAAAGCGT TCGATTGGGG ACTGTCGGAG
CCCGATTTTT CCAACGGAGC TGCGTATGGT GATCTGGACA ATGATGGCGA TCTGGATTTG
ATTGTCAACA ACATGAATGC GCCGGTAGCC ATTTACCAGA ACCAATCCGT CGAGAAGAAT
AAGACTAATT ACCTGAAGAT AAAGCTGGAA GGCACCGGGA TGAACCGAAA TGCCATTGGT
GCCCGCGTGT TTTTACACCA GCCCGGCCAG ACGCAGTTGC TTCAGCAGAT GCCGAACCGG
GGCTTTGAGT CGTCGGTCGA CCTGAACCTG CTTTTTGGAT TGGGAGCCAG CCCTACCATT
GATTCGCTGG TGGTTATCTG GCCGGACGAT AAAATGCAGA CGGTACGCAA GCCGAAAGCC
AACCAGATGC TGACGCTTCG GCAGCAGGAT GCCAACCAGC TCTGGAAACC TGATCCAACT
ACGCAAACGC CCGCTTTTCA GGACAATACG GGGACTTCTG GACTGAGCTA TCTGCATCAG
GAAAGCGCCT TTGTCGACTA TAACCGGGAT GCTTTACTGA AGCAACAACT GTCGACCCAG
GGCCCGGCAC TGGCAACCGG CGACGTGAAC GGCGACGGGC TTGACGATGT ATTCTTTGGT
GGAGCCAGCG GCCACGCGGG CCACTTATTC GTGCAGCATA CCGCCGGTCG ATTTGTCGAT
AAAACGCCCG CTGTTATGCG GCAGGATACG ACTTATGAAG CGGTCGACGC TGTTTTGTTC
GACGCTGATG GCGATAAAGA CCTGGACCTG TACGTGGTAT CGGGAAGCAA CGAATTTGGC
GAAGAAACCG ACGAACTGCT GGATCGTTTT TACCTCAACG ATGGCAAGGG AAATTTCACC
CGCTCGGAAA CTGCGCTGCC CAATTTGAAA GCCAACGGCT CCTGCGTGGC GGCCGGTGAT
TTCGACCGGG ATGGCGACAT TGACCTATTC ATCGGCTCCC GAATGATTCC CGGCCAGTAC
GGTCGGGACC CGGCCAGTTA TTTACTTTCC AACGATGGCA CCGGGAATTT CAAGAACTAC
ACCAAACGCT CGCTGGCCGA AGTAGAGCAG CTGGGCATGG TAACCGATGC CACCTGGGCC
GACCTCAACG GCGATTCATA TCCCGAGCTG GTGGTGGTAA GCGATTGGGG ACCCGTGCGG
GTATTCGAAA ACAAGCGGGG GAAACTGAGT CAAAACCCCG AATTCACCAT CAAAAACGCG
CAGGGTGATG TCCTGAAAAC GAATGGCTGG TGGAACTGCG TAACCGCCGG TGACGCCGAC
GGCGATGGCG ATCTGGATCT GGTGATCGGC AATCTGGGCA CGAATTCACG CATAAAAGCC
AATCAGACAA TCCCCGCTGA ACTTTACACC GGCGATTTCG ACCATAACGG CACCGTTGAG
CAAATCATCA ACTGCGCCGA CGAAACGGGT GAGTTATACC CAATGGTGCT GAAGCAGGAG
TTGCAGAAAG CCATGCCGAC GATCAAAAAG AAATACGTGA AATACGTCGA CTACGCGGGC
AAGAAAATCA CCGATGTGCT GGATGAGGAT CAGCGGAAGC AGGCGGTTGT CAAACAGGCA
TTCATGGGCG AAACTGTCAT TCTGCTTAAC GATGGAAAGG GAAAACTAAC GTTGCAGCCT
TTACCGGCCG AGGCTCAGTT TTCGCCCGTA TGTGGTGCCC TCTTCACCGA TTATGATCAC
GATGGACACA CGGACCTGCT GCTGACCGGT AATTTTCTGG ATGTACTTCC CGAAATCGGG
CGCTACGACG CGAGTTATGG CGTTACCCTG CACAACAAGG GAAAGGCGGC CAACGGCACC
ATCCGGTACG AATCCGTCAA TCCGGCGGTA TCCGGCTTTT TCGTCCGTGG GCAGGTTCGG
CACATCGCCC AACTAAAACA GGGGCAAATT ATTCTCGCCA AAAATAACGA CAAGGCACAA
GTGTTTTCTT TAAAGAAGTG A
 
Protein sequence
MRISLLVLAI GLIAGCRSEQ QQPADPLFRK LPAEETHVNF SNTITDDKDF NVFNYRNFYN 
GGGVAIGDVN NDGLSDVLLI ANMGDNKLYL NRTATSKAGS EQPGIQFEDV TAKAGVAGKR
AWSTGATFAD VNGDGRLDIY ICNAGNRDGD DRANELFINN GNDASGIPTF TERAAEYGLD
DRGYSTHAAF FDYDRDGDLD MYLLNNSFMP VGKLQYANLR SQRDSLGGHK LFENRSQGYS
LRSEERGARS EEPSGGGRKK GSGEKKGKTS GPVFVDVSEE AGIYGSLIGF GLGITIGDVN
DDNWLDIYIS NDFYERDYLY INNHDGTFKE SIKESMPHTS LSSMGADVAD VNNDGRLDIF
ITDMLPGNDR RLKRTSSYES HDLEQIKVGR DFHYQFMQNM LHLNQGNQPG NGGLPVFSDI
ARFAGVQATD WSWGALLFDM DNDGQKDIFV ANGIAKDVTD QDFVNFLADR ENMAQIARQR
AFNFKEFLDK APSEGVPNYA FHNDGNLQFT NKAFDWGLSE PDFSNGAAYG DLDNDGDLDL
IVNNMNAPVA IYQNQSVEKN KTNYLKIKLE GTGMNRNAIG ARVFLHQPGQ TQLLQQMPNR
GFESSVDLNL LFGLGASPTI DSLVVIWPDD KMQTVRKPKA NQMLTLRQQD ANQLWKPDPT
TQTPAFQDNT GTSGLSYLHQ ESAFVDYNRD ALLKQQLSTQ GPALATGDVN GDGLDDVFFG
GASGHAGHLF VQHTAGRFVD KTPAVMRQDT TYEAVDAVLF DADGDKDLDL YVVSGSNEFG
EETDELLDRF YLNDGKGNFT RSETALPNLK ANGSCVAAGD FDRDGDIDLF IGSRMIPGQY
GRDPASYLLS NDGTGNFKNY TKRSLAEVEQ LGMVTDATWA DLNGDSYPEL VVVSDWGPVR
VFENKRGKLS QNPEFTIKNA QGDVLKTNGW WNCVTAGDAD GDGDLDLVIG NLGTNSRIKA
NQTIPAELYT GDFDHNGTVE QIINCADETG ELYPMVLKQE LQKAMPTIKK KYVKYVDYAG
KKITDVLDED QRKQAVVKQA FMGETVILLN DGKGKLTLQP LPAEAQFSPV CGALFTDYDH
DGHTDLLLTG NFLDVLPEIG RYDASYGVTL HNKGKAANGT IRYESVNPAV SGFFVRGQVR
HIAQLKQGQI ILAKNNDKAQ VFSLKK