Gene Slin_4598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4598 
Symbol 
ID8728362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5576972 
End bp5579326 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content54% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389375 
Protein GI284039445 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.665219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCTA ACTACGTCAA AATCGCTCGT CGGCATTTGT GGACGAACAA ACTCTATACC 
GGTCTCAATG CCGGAGGGCT GGCCGTTGGG CTGACGGCCT GCCTGCTCAT GGTGCTGTAT
GTGAAGCATG AATTCACCTA CGACCGCTTT CACACCAAGG CCGACCGGAT CGCTCGGGTC
ACGACGAAGC TGACCACACC CGAGGTACCC ATTGTCGTAG CCTCCTCCTC CATCTTACTG
GCCAGCGCCT TGAAACGGGA TTACCCGGAA GTCGAAACAG CCGCCCGATT CGAGCCGGTG
TCGGCCACCA TTAGGTATGG GACCGATTTG CGGAACGAAC CCGATGTGTA CTTTGCCGAA
CCGGCCATTT TCCAGGTGTT TACTTACCCG TTCGTCGAAG GAGATGCGGC ACAAGCGTTG
ACCGAGCCAA ACACCGCCGT CGTTACTGAG AGTTTTGCCC GAAAATACGC CGGACGGACA
AGTGTACTGG GCGAGACATT CCTGTGCAAC AAAAAACTGT ACCGCATTAC AGGCGTCATG
GCCGATTTGC CGTCCAACGC CGACATGAAA ATCAGCGCGT TGCTGGCTAA AGATTACACA
ACCTACACCG ACTGGCTGGT GGATGACTTT CCGGTCTATA CGTTCGTTCT GTTCCGGCAA
ATTCCTAATC TGAACGTCTT CGAGAAGAAA CTGGCCTTAC TCAGCAAAAC CTACATCCAG
CCGGAACTCA AGAAAATAGG GGCAACGGGC TATGCCGTTG TTTTCCAAAC CGAGTTGCTG
AAAGACGTCC ACTTCAGTCA GGGGAAAATG GCTGATATGC CGAAGGGCAA TAAACAATAT
GGCTATATCT TCCTGTTTCT GGCCGTTTTT GTGCTGGTAA TCGCCTTGCT GAACTACATC
AACCTGCTGA CGGCCCGCGC CACCGGGCGA GCGAAGGAAG TAGGCGTCCG AAAAGCCAGC
GGAGCCTTGC GGACACAGTT GATCGGCCAG TTTTTACTGG AGTCGTTCCT GTTGAGCTGG
CTGGCGGTCG GGCTGGCGAT CGTCCTGCTG GCAGTCAGCA TTCCTTTTTT CAACGACTTG
CTGCAAGTTC AGCTTACGGT CGGCTGGCCC GACGGGTTTC TGATGGCGGG CGTGGCGGTA
GCAAGTACAA CCCTGCTGGG CGGACTCTAT CCGGCCTTTG TCCTGTCCGG CTTTGACCCG
GCGACTATCT TACGTAAACA GGCGGGCGGG TTGGGTCGCG GCTTTGGGCT TCGGCAAACG
ATCACCGTAT TTCAGTTTAT ACTGGCGGTG GGCATGATGA TTGGTGTACT GGTGGCCCAT
AGTCAGATGA ACTACATGCA GCGCGTTGAT CTGGGGTTCA CAAAAGAGCA AGTGCTAACC
GTTCACCTCC CCGACGATTC GCTGGCCAGA ACCAGGGGCT ATGCCTTCGC CCAGGCGTTG
CGGCAACGCA CCGAAATCAG GGATGCATCG TTGGGATCGG GGATTAAGCC CGATGCCATA
CTGGTCAAAG CAACGACCCT ATTTCAATCG GCGGGCAAAA AGCGGGAAGT CATGGGCAAT
TATTTGTCCA TCGATGATCG TTTTCTGCCG TTGCTGAACC TGAAACTGGC GATTGGCCGG
AATCTATCGG CGGACTCGGA AGCCGACAAG AACGGAGCCT TTCTGGTCAA CGAAGCCTTT
GTTAAACAAG CGGGCTGGAA ACAGGCCGTT GGTCAGCCTA TGGAGGGATT TATGCACAAG
GGTAAGGTAA TTGGCGTGGT CAGGAATTTC CATTTCCATT CCCTGCATAC AGCCATTGAA
CCGGTCATAT TGGTTTTCAA CACCAATCCA CCCGCCAACC TGACGCTGAA AATGAAGCCG
GAACAATTGC CGCTCGTACG AGCAACCTGG AAGCAGCACT ATCCGAACTT CCCTTTCGAT
TACACGTTTC TGGACGAGGC CTTTGCCGCC CAATACCGTA AAGACGAGTT GATGATCATT
CTTTTCAACG GATTTTCATT GCTAACTATA CTGGTTTCCT GTCTGGGCCT GTTCGGTCTG
GCGACCTACT CGGCCGAGCA ACGAACCAAG GAAATCGGGG TGCGTAAAGT ACTGGGTGCA
AGTGTCCTCA GCATCGTGGC ACTTCTGTCG AAAGATGTCT TTAAACTGGT CCTCATCGCC
ATTGTCATTG CCTCTCCCCT GGCCTGGTAC GCTATGAATA AATGGCTGAC CGACTTTGCC
TACAAAATCG ACATTAGCTG GTGGATGTTT GTGCTGGCGG GTGTGCTGGC CCTGGGTGTT
GCCCTGCTAA CCATGAGTTT TCAGAGCATA AAAGCTGCGC GGATGAATCC AGTGAAATCA
TTACGGACGG AATAG
 
Protein sequence
MIANYVKIAR RHLWTNKLYT GLNAGGLAVG LTACLLMVLY VKHEFTYDRF HTKADRIARV 
TTKLTTPEVP IVVASSSILL ASALKRDYPE VETAARFEPV SATIRYGTDL RNEPDVYFAE
PAIFQVFTYP FVEGDAAQAL TEPNTAVVTE SFARKYAGRT SVLGETFLCN KKLYRITGVM
ADLPSNADMK ISALLAKDYT TYTDWLVDDF PVYTFVLFRQ IPNLNVFEKK LALLSKTYIQ
PELKKIGATG YAVVFQTELL KDVHFSQGKM ADMPKGNKQY GYIFLFLAVF VLVIALLNYI
NLLTARATGR AKEVGVRKAS GALRTQLIGQ FLLESFLLSW LAVGLAIVLL AVSIPFFNDL
LQVQLTVGWP DGFLMAGVAV ASTTLLGGLY PAFVLSGFDP ATILRKQAGG LGRGFGLRQT
ITVFQFILAV GMMIGVLVAH SQMNYMQRVD LGFTKEQVLT VHLPDDSLAR TRGYAFAQAL
RQRTEIRDAS LGSGIKPDAI LVKATTLFQS AGKKREVMGN YLSIDDRFLP LLNLKLAIGR
NLSADSEADK NGAFLVNEAF VKQAGWKQAV GQPMEGFMHK GKVIGVVRNF HFHSLHTAIE
PVILVFNTNP PANLTLKMKP EQLPLVRATW KQHYPNFPFD YTFLDEAFAA QYRKDELMII
LFNGFSLLTI LVSCLGLFGL ATYSAEQRTK EIGVRKVLGA SVLSIVALLS KDVFKLVLIA
IVIASPLAWY AMNKWLTDFA YKIDISWWMF VLAGVLALGV ALLTMSFQSI KAARMNPVKS
LRTE