Gene Slin_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2472 
Symbol 
ID8726216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2985590 
End bp2989108 
Gene Length3519 bp 
Protein Length1172 aa 
Translation table11 
GC content50% 
IMG OID 
ProductASPIC/UnbV domain protein 
Protein accessionYP_003387290 
Protein GI284037360 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0642293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.355386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCA CCACCCGCGA CTGTAACTTT TTGACCATGT CAATTCTTAA AAAAACGGAT 
TTCGTGAAAA CGATTACGTA TCTACTTGCG CTAACCAGCC TCATTTCAAT AACCGGGTGT
AATCAGTCGA AGTCAAGCGA GGAGACAACG AATAAGGATG GCCCACCACT CTTCACCTCT
CTAACCCCCG AACAAACCGG TATTACCTTT GCCAATAACC TGACCGAAGG CCTCAACACA
AACGTGTTGA TGTACGAATA CTTCTACAAT GGGGGTGGTG TTGCAGTAGG TGACTTAAAC
GGTGATGGGC TTGAAGACAT TTACTTTAGC GGCAACATGG TTCCTAACCA GTTGTACATT
AACAAAGGGG GGATGAAATT TTCGGACGTC ACCGCCACAG CGGGCGTAGC CGGACGAGAA
GGCCCCTGGC GAACGGGCGT CTCTATGGCC GATGTAAACG GCGATGGTAA GCTGGATATT
TTTGTTTGTT ATTCTGGAAG TTTGCCACCG CATAAACGCA TTCCCCAACT GTTTATCAAC
GAAGGGGCAG ATGCGCAGGG AGTTCCCCAT TTTTCTGACC AGACTGCACA GTATGGGCTT
GATAAACCCG AACAAAGTAC GCAGGGGGCT TTTTTTGATT ACGATCTAGA TGGTGATCTC
GACCTGTTTC TGCTTAACCA TAATCCGCGG TTACTCCCCA TCCTCGACCC GGATGCAACG
GCGGCTATTA TGAAACAGCC TAATCCTGAA ATCGGCGTTC GGTTATTTAA GAATACGGGC
AATCGATTTG AGGATATCAC CGAAAAGTCG GGACTGAGCA GTTCGGTATT GACATATGGA
TTAGGTATCG GCGTATCGGA TATCAACGGG GATGGCTGGC CCGATTTGTA TATCTCCAAC
GATTATGGCG TTCCCGATTA TCTCTACATA AACAATCAGG GAGGACGGTC TGGCGGTCCG
GTGTTCACCA ATCACCTGAA AACCAGTATT GGGCATACCT CTAATTTTTC AATGGGTAAT
GCCGTGGTTG ACATTAATAA CGATTCCCGC CCAGACATAT TCACCCTCGA CATGCTGCCC
GAAGATAACC GGCGGCAGAA ACTGTTAATG GCTCCCGATA ATTACGAAAA ATTTGATCTG
AGTGTTCAGT CAGGGTTTCA TTACCAGCAT ATGCGTAATA TGCTCCAGAT CAATGAAGGA
GTAATTGGTT CGGGCCAGCA AACAAAGGGG AGTTCATCTG CGGCAGCCGC TCCACTTTTC
AGTGAGACAG GCCAGTTAGC CGGTATCTCC AACACCGATT GGAGCTGGTC GCCGTTGTTC
GCCGATTATG ATAATGACGG ATGGAAAGAC CTGTATGTTA CCAACGGCTA TGTTCGCGAT
TATACAAATC TGGACTTTCT TAAGTTCATG ACGGACTATA TGCAGAACCG GCCAGCCAAC
TTCCGTCGTG AAGATGTGCT CGAACTGGTT CACCGCATAC CCTCTTCCAA CGTTATGAAC
TACATGTTCC GCAATCGGGG CAGCGAGCCG GGGGATGAGG TTACGTTTGC CAATGTTGGA
GCCGACTGGG GATTAACGCA AACCTCAAAC AGCACTGGTG CCGCTTATGC TGATCTCGAC
AACGATGGCG ATCTGGACCT GATTGTCAAC AATACCAACC AACCGGCTTT TGTCTTTAAA
AACGAAGCAA ATACGGTGCG TAACCACCAT TACCTGGCTA TTCAATTGAC GGGCGGGCGG
AGCAACACCC AGGGCTTGGG CACAAAAGTA ACGCTGTATC GAAAAGGTAA ACAACAGTTT
CTGGAGCAGA TGCCTACACG TGGGTATCAG TCGAGTATGT CACCCCGTTT GCACTTTGGT
CTGGGAACCG ACGCGCTCAT CGATTCGCTG CGCATTGTGT GGCCAACGGG TAAACAGCAA
CTGCTGACGG GCGTTAAAGC CGATCAACTG CTGAAACTGG ACGAGAAAGA TGCCGTTACG
CTATTTAAAA ACCCGATAAC GGCACCAACC TTATTTCAGG AAGTGAAATC GCCGGTAGTC
TTTGTCGATC CGGCCAAGCG AATAAACGAT TTTAAACGAC AGCCCCTCCT GGTGAACGCC
CAGTCGTTCA GCGGGCCTTG CCTGGTTAAA GCCGATGTGA ATGGCGACGG TCGCGAGGAT
ATTTACGCCG GAGGAAGCAG TGGAGAGGCT GGAGCTTTAT TTATTCAGCA ACCTTCCGGG
CAGTTTAGCC GACTGACACA GCCTGCTTTC GAAGCAGATA AACTGAGCAA CAATGTTGAT
GCGGCTTTCT TCGATGCCAA TGCCGATGGT TTTCCTGACT TATATGTGTG CAGTGGCGGG
TATGCCGATC TGGAGATTAG CGACGATGTA CGGTTGCTGG ATCGCTTATA CCTGAACGAC
GGTAAGGGCA AATTTACCAA AAGCCCGAAC GCTCTACCTG CCATGCAAAC CAGCTCTGGT
TGTGTGCGGG TAGCTGACGT TAACGGCGAT GGTCGCCCGG ATGTATTCGT CGGCGGGCGC
GTTGTGCCGG GCCGTTACCC TGAAACACCC CGGAGTTACC TGCTCGTTAA TGCCGGGAAA
GGACCGGACG GCCAACCCAG GTTTACAGAT CAAACGGCTC AACTGGCTCC TATGCTCGCA
ACGATTGGTA TGGTTACCGA TGCCGCCTGG ACCGACCTCA ACGGTGACCG GAAACCTGAA
CTGGTTATCG TTGGTGAGTG GATGCCGATA ACCGTACTTG GCATGGAAAA AGGGGGCAAC
TCCACTCAAT TAACCGACCA GACAAAAACG TATTTCGAAA AAGAATACCG GGGCTGGTGG
AACAAGCTGC TGGTGAATGA TATAAATGGA GACGGCAAGC CCGACCTGGT GATCGGTAAT
CTGGGACTTA ATACGCAATG CCACGCCAGC GATAAAGAAC CCGCCGAGCT TATTTACAAG
GACTTCGACA ATAATGGTAA AGTAGACCCA ATTCTTTGTT TCTACATACA GGGCAAAAGC
TACCCCCACG CTACCCGCGA TGAACTACTG GATCAATTAG GGATACTTCG TCACCGGTTT
ACCAATTTCG AAAGTTATTC AAACGCTACC CTGACAGACG TATTCACGGA GTCGGAGTTG
TCGGGAGCCA ATAAATTGAC AGCTAATCAG TTGACGACAA GTTGCTTTGT TAGCAGCGCT
GGCGGAAAGC TGGTTGAAAA GACACTTCCG CTGGCGGCAC AGGCATCACC AGTGTTTGCA
CTAACCGCCC TGGATTTCAA TCACGATGGC CGGAAAGACC TGCTGTTATG CGGTAATGCC
AGACAGGCCC GGTTGCGCTT TGGGCGCTCG GAAGCGAACC CCGGACTGCT GCTTCAGGGT
GATGGGAAGG GTGGTTTTCA AGTGGTGCCG CAACCACGTT CAGGTTTTAA GTTGACTGGT
GACGTACGGA GTATAGTAAC CGTCGACAGC GACCGCACAT TCGTCTTTGG GGTTAACCAG
CAGGCTGTGC AGGCGTACCG AATGGCTGGG ACCAAATAA
 
Protein sequence
MKSTTRDCNF LTMSILKKTD FVKTITYLLA LTSLISITGC NQSKSSEETT NKDGPPLFTS 
LTPEQTGITF ANNLTEGLNT NVLMYEYFYN GGGVAVGDLN GDGLEDIYFS GNMVPNQLYI
NKGGMKFSDV TATAGVAGRE GPWRTGVSMA DVNGDGKLDI FVCYSGSLPP HKRIPQLFIN
EGADAQGVPH FSDQTAQYGL DKPEQSTQGA FFDYDLDGDL DLFLLNHNPR LLPILDPDAT
AAIMKQPNPE IGVRLFKNTG NRFEDITEKS GLSSSVLTYG LGIGVSDING DGWPDLYISN
DYGVPDYLYI NNQGGRSGGP VFTNHLKTSI GHTSNFSMGN AVVDINNDSR PDIFTLDMLP
EDNRRQKLLM APDNYEKFDL SVQSGFHYQH MRNMLQINEG VIGSGQQTKG SSSAAAAPLF
SETGQLAGIS NTDWSWSPLF ADYDNDGWKD LYVTNGYVRD YTNLDFLKFM TDYMQNRPAN
FRREDVLELV HRIPSSNVMN YMFRNRGSEP GDEVTFANVG ADWGLTQTSN STGAAYADLD
NDGDLDLIVN NTNQPAFVFK NEANTVRNHH YLAIQLTGGR SNTQGLGTKV TLYRKGKQQF
LEQMPTRGYQ SSMSPRLHFG LGTDALIDSL RIVWPTGKQQ LLTGVKADQL LKLDEKDAVT
LFKNPITAPT LFQEVKSPVV FVDPAKRIND FKRQPLLVNA QSFSGPCLVK ADVNGDGRED
IYAGGSSGEA GALFIQQPSG QFSRLTQPAF EADKLSNNVD AAFFDANADG FPDLYVCSGG
YADLEISDDV RLLDRLYLND GKGKFTKSPN ALPAMQTSSG CVRVADVNGD GRPDVFVGGR
VVPGRYPETP RSYLLVNAGK GPDGQPRFTD QTAQLAPMLA TIGMVTDAAW TDLNGDRKPE
LVIVGEWMPI TVLGMEKGGN STQLTDQTKT YFEKEYRGWW NKLLVNDING DGKPDLVIGN
LGLNTQCHAS DKEPAELIYK DFDNNGKVDP ILCFYIQGKS YPHATRDELL DQLGILRHRF
TNFESYSNAT LTDVFTESEL SGANKLTANQ LTTSCFVSSA GGKLVEKTLP LAAQASPVFA
LTALDFNHDG RKDLLLCGNA RQARLRFGRS EANPGLLLQG DGKGGFQVVP QPRSGFKLTG
DVRSIVTVDS DRTFVFGVNQ QAVQAYRMAG TK