Gene Slin_4639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4639 
Symbol 
ID8728403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5646738 
End bp5649113 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content57% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389416 
Protein GI284039486 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGAA ACTACCTCAA AACGGCCCTT CGAAATTTAT GGAAACACAA GCTGTTTTCA 
TTCATCAATG TCTTTGGGCT AGCGTCGGGC ATGCTGGTGT GTTTGCTGGC CATGATCGAC
ATCAAAGGCG CGTTCGATTA TGATTCGTTT CATCCACACA CCGACCGGAC ATACCGCATT
CTGACCGATG TGACCGACAA AGCGAACGAT GAACGGGCGT TTGCCACTAC CCCCCTGCCG
CTGGCCGACG ACCTCAGCCG AAACTACCCG TTTGTGGAGG CCACCACGCG GGTTATCCGC
CAGTATGGGG AAGCCACCGC GAATCGAAAG CAACTTCAGG TGATAGCCAG CGCCGTCGAC
CCCGGATTCT TTACCGTCTT CGGGTTCAGG CTGGCAGCGG GTCAGGCAGC CACCGCCCCC
GGAACGGTCG TGCTCACCAG ACAAACGGCC GAACGGTTTT TCGGGACAGC AAACCCGGTG
GGAAAGGTGC TGGAACATAC TGGATTAGGA CCACTGACGA TTACGGGCGT TTTTGCCAAA
CCCCCAACCA AAACCCACCT CAACTTCGAT ATGGTGGTGT CGATGGCCAC CCTATCCACC
CCGGACTGGC AGCGCAAACG GGCCGACTGG ACGCAGTATT CGCAGGGCTA CACATACGTT
CTGCTCAAGC CGAACACCCC AACCGAAACA CTGGAAGCTT CCCTCCCGGC CCTGGCCAGC
CGGGTTACGA CCGGCATCCG GTTTGCTACT GAAAAAGGGT ATACCTTCCG TACGCAGGCC
CTGGCCAGGA TTTCGCCCTC GCGCGAGGAC CTTATGTATG CTACGTACGA ACCCACCGCC
GGAAAGCTGG AAGCCGAACT GGGCGTTGGC TTACTGACGT TGCTGCTGGC AGCTTTCAAT
TACATCAACC TCACTCTGGC CCGGTCGCTG AGCCGCGCCC GCGAAGTGGG CATCCGGAAA
GTGGCCGGGG CAATGCGCTG GCAGTTGATG GGGCAGTTCA TGGCCGAATC CGTCATTTTG
TCGGTGCTGG GCCTTGGGCT GGCGTATGGT ATGCTACAAC TGGTAAAACC CATGCCTTTT
GTTCAGCAAT GGCTCATTGG CGATAGTCAA TGGGAAACGA ACAGCACCCT TTGGACGGTG
TTCGTAGTAT TCAGCGTGGT AACGGGCTTG CTGGCGGGCT TGTTACCGGC CCGCGTACTG
TCGGGATTTC AACCGGCGCA GGTACTCCGC AGCCAGACCG GTCTGAAAGC GTTCAGGGGT
GTAACGTTAC GTAAATCCCT GATTGTGGCG CAATTCTCCA TTTCGTTACT CGCCATGATC
GCGCTGCTGG CTATGGCCCG ACAGCAGCAG TTTATGGCCA CGGCCGACTA CGGCTTTCAG
CGGGAAGGGC TGTTGACCAT TCCGCTGAAT GGAATGCCAC CGGCCCGCCT TTCGGCCCAG
ATCAGCCAAT TGGCGGGTGT GGACCGGGTA GCCGCAACCG TCGCGCTATT TGGTGATCAC
GGCGGAAACT GGCAAAAAGT GTATCGGCAG AAAGCCAAAA GCGATTCCTC GATAACCGAT
GTTTTTGCCG CCGATGCCAA CCTGATTCCT ACCGCCGGGC TAACCTTGGT GGCCGGACAG
AACATGCCAC AATCTGCATC CGATACGGCC TCCAACCAGG TTCTTATCAA CGAGGAAGCC
GTGAGGACCT TTAAACTGGG CGAACCCAAA GCGGCTGTCG GGCAAACGCT CTGGCTCAGC
GACAGCACGG AGGTGCAGAT TGCCGGTGTT GTGAAAGATT TCCAGTTTAC GACGATGGTC
TGGAAAATCC GTCCGTTGAT ACTCCGCTAT CAACCCGGTG ACTTCCGGTA CCTGACAGTG
AAAGTTGCGG GGGGAAATCC CGAATCCGTC AAAGCCGACA TCGCCCGTAT CTGGAAACGG
CTCAACCCCT ACGAACCCTT CGCCGGGCAG TGGTACGACG ATTTCCTGTA TAACCGGCAC
AGCCATACCG ACGATCTGAG TTTTATGGGC TTGCTCCTTG GCCTGGCCAT GTCGATTGCC
TGCCTGGGCC TGTTGGGGAT GGTGACCTAC ACCACCGCCC TAAGGACCAA AGAAGTGGGG
GTTCGGAAAG TGATGGGTGC CAGCGTTGGG CAGGTGGTGT GGCTGCTGTC GTGGGATTTC
CTGCGTCTGC TGCTCATTGC CGGTACCATT GCCATGCCAT TGGGGTACCT GGCCAGCAGC
TTCTTCCTGA TGACATTCGC CTATCACATT ACGGTAGGCG TCGGACTGCT GGGGCTGTGC
TTCGGCACAA TGCTCCTGCT GGGTGGCCTG ACCATTAGCT GGCGAACATA CCGGACAGCC
CTGACCAACC CGGTGAATAG TCTTCGAAAT GAATAA
 
Protein sequence
MLRNYLKTAL RNLWKHKLFS FINVFGLASG MLVCLLAMID IKGAFDYDSF HPHTDRTYRI 
LTDVTDKAND ERAFATTPLP LADDLSRNYP FVEATTRVIR QYGEATANRK QLQVIASAVD
PGFFTVFGFR LAAGQAATAP GTVVLTRQTA ERFFGTANPV GKVLEHTGLG PLTITGVFAK
PPTKTHLNFD MVVSMATLST PDWQRKRADW TQYSQGYTYV LLKPNTPTET LEASLPALAS
RVTTGIRFAT EKGYTFRTQA LARISPSRED LMYATYEPTA GKLEAELGVG LLTLLLAAFN
YINLTLARSL SRAREVGIRK VAGAMRWQLM GQFMAESVIL SVLGLGLAYG MLQLVKPMPF
VQQWLIGDSQ WETNSTLWTV FVVFSVVTGL LAGLLPARVL SGFQPAQVLR SQTGLKAFRG
VTLRKSLIVA QFSISLLAMI ALLAMARQQQ FMATADYGFQ REGLLTIPLN GMPPARLSAQ
ISQLAGVDRV AATVALFGDH GGNWQKVYRQ KAKSDSSITD VFAADANLIP TAGLTLVAGQ
NMPQSASDTA SNQVLINEEA VRTFKLGEPK AAVGQTLWLS DSTEVQIAGV VKDFQFTTMV
WKIRPLILRY QPGDFRYLTV KVAGGNPESV KADIARIWKR LNPYEPFAGQ WYDDFLYNRH
SHTDDLSFMG LLLGLAMSIA CLGLLGMVTY TTALRTKEVG VRKVMGASVG QVVWLLSWDF
LRLLLIAGTI AMPLGYLASS FFLMTFAYHI TVGVGLLGLC FGTMLLLGGL TISWRTYRTA
LTNPVNSLRN E