Gene Slin_4593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4593 
Symbol 
ID8728357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5566581 
End bp5568998 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content54% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389371 
Protein GI284039441 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGCA ATTACCTGAA AGTCGCCCTG CGAACGCTCT GGAAACACCG CACGCACACA 
CTCATCAACA TCGTTGGGCT GTCGGTGGCG TTCGGTACCT GCGTGCTGTT GTTCCTGACG
GCTACCTTCG AACTCTCCTA CGACAGCTTT CATACCGATG CCGACCGCAT CTTCCGGCTG
AACTTTCTGT CGACCAACCG CGATGGAACG ACGGATAGAG GCAGCACCAT GCCGTATCCC
ATCTCCCCCG CCCTCAAAGC CGAATTTCCG GAGATTGAAG GGGTTACGCG TTGGTTCGAC
CGGAGCGCGA GTATCCGGCG CAATAGCCAG ACCTACACCA AGGATGTCCG CATGGCCGAT
GCGGATTTTC TTCATATGTT CTCCTTCCCG CTCCAGAAAG GCAACCCCAA AACGGCCATG
AACGGTCTGA GCGACATTGT CATCAGCGAA CGTATGGCGA ACGATATTTT TGGGAAGGAA
GACCCCGTGG GCAAGCCCCT CCAGCTCCGT ATGAACGGCG CCTGGCAGGC ATTTACCGTG
ACGGGCGTAA TAAGCAATCC TCCAAAAAAC TCAACCTTCG ATTTTGATGC ACTCATCCGC
AGCGACAACG CGGGCGATTA TCAGGAGTTC AAAAGCCGCT GGGACCACGG CAACCATGAT
GTATATGTAC AGGTAAAAGC CGGTACCGAC GCGCAAACCC TGCAACGCCG GACGCAGGCT
TTCATGGACA AATACTTCGC GAAGGATATT AAAGAGCGGC AGGAACAGGG GTATCCAAAA
AATGAACTGG GTTACCAGCG CAGCCTTCTT CTGGAGCCCT TGCGCGATGT CCATTTTGAC
ACCGTCACCA CACATGGTGC TGGTATCAGT CGGGCGTACG TGTATACATT GCTGCTCATC
GGCCTGTTCA TTCTCGCCAT TGCCTGTATC AACTTCATTA ACCTCACCAT TGCGCAATCG
CTCTCGCGGG CTCGGGAAGT GGGCGTTCGT AAATCGCTGG GGGCTCAACG GGCACAGCTG
TTTGGCCAGA TCTGGGGTGA AACCCTGTTG CTGTGCTTCG GCGCATTAGT CATTGGCCTG
GGGTTGGCGT ATGCTGTGCT ACCCACCTTC AATCGCCTGT TCCGAAGCTA TCTGACACTG
GACAACTTCC TGACACCAAC CGTATTGCTG GTAACGGCAT TGTGCTTTCT GCTCATCACG
CTCATAGCGG GCGGCTATCC ATCCTGGTTT GTGACGCGCT TCAATGCTGT GGAGGTCCTG
AAAGGCCGGG TGAAGGTGAG CAAACCGGGC GTACTTCGCA ATTCACTCAT CATCACTCAG
TTTACCATAG CCTGTCTGCT TATCGTCTGC ACCATAATAG TCCGGCAGCA GATCACCTAT
TTGCAGCAGC GACCGATGGG CATGGACAAA GAACAGGTTA TCAGCGTACC GGTGGGTGGC
GAACTCAACG GCACCGTCGC CCTGAAAGCT ATGCGCGACC GGCTGGCCAA CCAACCCAAC
ATCACCGCCG TATCGGGTTC GGGCGTGAAC ATCGGCGCGG GACTGGACGG TAGCTCATCC
CGAATGATGT TCGGTTTCCA ATACGGCAAA CGGGACGTTA CCTGCGACTG GCTCCGCATC
GACACGGATT ACCTGAAAAC AATGGGTATC AAACTCCTGA AGGGCCGCGA TTTCAGCCCG
GACTTCAGTA CGGATTCCAG CTCGGCGGTG CTGATTACCC AGAGTATGGC GAAGGCACTA
GGCGAAGCAA ATCCTATAGG AAAATTCATT AAGCCCGATA ATAAATCGTA CCAGATTGTG
GGTGTCGTTT CCGATTTCAA CCTGTACTCC CTGCATCAGG AAGCCAAACC GATTACCTTG
CAAATGGAGT CGAGCGCACC CATTCAGTAC ATTCTTGTCC GGGTAAATCC GCAGAATCTA
ACGGGCGCAA TGGAAACCAT CAAGACTGCC TGGAAGACCA TCGCGCCCAA ACAGGAGTTC
ATCGGCTCGT TTCTGGATGA AAACACCGAA CGCTGGTATC GGAAAGAACA GCGGTTATCG
ACCATCTTTT CGACCGCTGC GGGCATTGCC ATTTTGCTCT CGTGCATGGG TTTGTTCTCC
ATCGCTCTGC TCACCATCGA ACAGCGCACC AAAGAGATTG GCGTTCGGAA AGTGCTGGGT
GCCAGCGTAG CCAGTATTGT GGCCCTGCTC TCAAAAGACT TTCTAAAACT GGTTGTAGCG
GCCATCGTCA TTGCCTCACC TCTGGCATGG TGGGCCATGG ACAACTGGCT TCAGGATTTC
GCCTATAAAA TTGATATTGC CTGGTGGGTC TTTGCGGTAG CGGGTTTGCT GGCGGTTGTG
ATTGCGCTGG CAACCGTAAG CTTCCAGAGT ATCAAAGCCG CCTTGATGAA CCCAGTGCAA
TCGTTACGGT CCGAATGA
 
Protein sequence
MFRNYLKVAL RTLWKHRTHT LINIVGLSVA FGTCVLLFLT ATFELSYDSF HTDADRIFRL 
NFLSTNRDGT TDRGSTMPYP ISPALKAEFP EIEGVTRWFD RSASIRRNSQ TYTKDVRMAD
ADFLHMFSFP LQKGNPKTAM NGLSDIVISE RMANDIFGKE DPVGKPLQLR MNGAWQAFTV
TGVISNPPKN STFDFDALIR SDNAGDYQEF KSRWDHGNHD VYVQVKAGTD AQTLQRRTQA
FMDKYFAKDI KERQEQGYPK NELGYQRSLL LEPLRDVHFD TVTTHGAGIS RAYVYTLLLI
GLFILAIACI NFINLTIAQS LSRAREVGVR KSLGAQRAQL FGQIWGETLL LCFGALVIGL
GLAYAVLPTF NRLFRSYLTL DNFLTPTVLL VTALCFLLIT LIAGGYPSWF VTRFNAVEVL
KGRVKVSKPG VLRNSLIITQ FTIACLLIVC TIIVRQQITY LQQRPMGMDK EQVISVPVGG
ELNGTVALKA MRDRLANQPN ITAVSGSGVN IGAGLDGSSS RMMFGFQYGK RDVTCDWLRI
DTDYLKTMGI KLLKGRDFSP DFSTDSSSAV LITQSMAKAL GEANPIGKFI KPDNKSYQIV
GVVSDFNLYS LHQEAKPITL QMESSAPIQY ILVRVNPQNL TGAMETIKTA WKTIAPKQEF
IGSFLDENTE RWYRKEQRLS TIFSTAAGIA ILLSCMGLFS IALLTIEQRT KEIGVRKVLG
ASVASIVALL SKDFLKLVVA AIVIASPLAW WAMDNWLQDF AYKIDIAWWV FAVAGLLAVV
IALATVSFQS IKAALMNPVQ SLRSE