Gene Slin_5704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5704 
Symbol 
ID8729479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6938635 
End bp6940701 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content54% 
IMG OID 
Productoligopeptide transporter, OPT family 
Protein accessionYP_003390468 
Protein GI284040538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.525123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTAA CCCCACCCGC CCATAAACCC TTTGTTCCGG CAACGGAATC GCCCGCCGAG 
TTTACGCTGA AAGCAGTTAT CACCGGCGCG GTATTTGGCG TCCTCTTTGG CGCAGCAACG
GTTTACCTCT CGCTGAAAGC CGGGCTATCT GTGTCGGCTT CCATTCCCAT CGCGGTGCTG
GCCATTTCCC TCGGCCGTCG ATTTCTGAAT ACGACAATTC TTGAGAATAA TATCATCCAG
ACCACTGGTT CAGCGGGAGA GAGTATTGCA TCCGGAGTAG TGTTTACCAT GCCGGGGTTC
CTCTTCTTGA CCAACGGTTC CGGGGCCGAT TTTTTTAATT ACTGGACGAT TCTTACGCTG
GCTATTCTGG GTGGACTGAT CGGTACGTTG ATGATGATTC CGCTGCGTCG GTCGCTTATT
GTGCAGGAAC ACGGCACCCT GCCGTATCCG GAGGGGACCG CCTGTGCGTC TGTCCTGATT
GCCGGGGAGA AAGGCGGTGA CTTTGCCAAA ACGGCCTACC AGGGGCTGGG GGCAGCACTT
ATCTATGCCT TTCTCCAGAA AGTGCTGCAC ATCATTGCCG AAGTGCCGGT ATGGGCCACG
AAGCAGGCCA ATCGGTATTT CCCATCGGCG CAGGTAGCGG GCGAAATAAC GCCCGAGTAT
CTGGGTGTAG GTTATATTAT TGGTTTCCGG ATTTCGGCGG TGCTGGTCGG TGGCGGTATT
CTGGCCTGGC TGGGGCTTAT TCCGCTCCTG GCTTCTGTTG TGCCGGGCGA CACGATAGCG
TTGCAATTAC AGAAACTGGG CTATCTCGAT AACCTCCAAA CCGTCGGTGG CCCGGGTGGC
TGGAATCCGG CTACGCACAC GTTCTCAGAC ACGGCAGCGG CCATCTATCG GGCATATATC
CGGCAGATTG GGGCTGGTGC CGTAACCGCC GGTGGGTTCA TGACGCTTAT AAAAACCATT
CCAACCATTG TCTCTTCGTT TAAGCAGAGT TTCAGCAAAA CCTCGATCAA CGCCATCGAA
GCCGAAAATG CGATGAGCGG ACACAGCGGC ATTCCCCGAA CAGAACAGGA CTTGAGCGTG
CGGATCGTTA TTATTGGTAG CATTGTGCTG GCAGCTCTGA TGGTTATATT ACCCCAAATC
CCCGGCGATT CTATATTGAC TAAATTACTG ATCGCTGTAC TTGTCATTGT ATTCGGCTTC
TTTTTCGTTA CTGTGGCCAG CCGAATTGTG GGGCTGATCG GGTCAAGCTC GTCACCCGTT
TCTGGAATGA CCATTGCCAC GATCATGGGC ACGTCGCTGG TGTTCATTGG CTTCGGTCTG
ACCGGGAAAC TGTATGAACC CGCCGTTTTG GTTGTGGGAA GCATGATTTG TGTGGCGGCT
GCCAATGCCG GTGCAACCTC GCAGGATCTC AAAACCGGAT ACATCGTCGG CGCGACGCCG
AAATACCAAC AAATGGCCTT GTTTATCGGC GTCATAGTAT CGTCGCTGGT CGTGGGAGCA
ACGGTCAAGA TTCTGGACAC CCCCACGCCT GATCTGCTGG CGCAGGGTAT TACGCACGCC
ATTGGCTCCG ATAAGTTCCC GGCTCCGCAG GGTACACTGA TGGCCACCCT CATCAAAGGT
TTATTGTCAT TCAACCTCGA CTGGCAGTTT GTGCTCGTAG GTGCCTTTAT TGCTTTTGTG
TTCGAGTTAG TGGGCGTGAG TGCGCTGGCG TTTGCGGTGG GCTTGTATTT ACCGCTGTCG
ACTACACTAC CCATTTTTGC CGGTGGTCTG GTCAAAGCCA TCGTCGACGC CAACAAGAAA
CGGAAAGGTA TTGTGGAGGA AGATGCCGAT CTGGGTAAGG GAAATCTGTT TGCAACGGGT
CTGGTTGCCG GAGGAGCCTT AGCAGGGGTA GCCGTAGCCT TGCTATCCGT CAACGCATCC
ATCTACAATG GGCTGGCCTC GCTTACGCTG GAGCCTTTGC TGGGTAGCGC ATTGGGAGAA
GGCGGCTACA TGCTGCTGGG TGCGTTGTTG TTCGTTGGTC TGGCAATTGT ACTATGGCGC
GTAGGTGAAC AGAAACAGGA AGGCTGA
 
Protein sequence
MALTPPAHKP FVPATESPAE FTLKAVITGA VFGVLFGAAT VYLSLKAGLS VSASIPIAVL 
AISLGRRFLN TTILENNIIQ TTGSAGESIA SGVVFTMPGF LFLTNGSGAD FFNYWTILTL
AILGGLIGTL MMIPLRRSLI VQEHGTLPYP EGTACASVLI AGEKGGDFAK TAYQGLGAAL
IYAFLQKVLH IIAEVPVWAT KQANRYFPSA QVAGEITPEY LGVGYIIGFR ISAVLVGGGI
LAWLGLIPLL ASVVPGDTIA LQLQKLGYLD NLQTVGGPGG WNPATHTFSD TAAAIYRAYI
RQIGAGAVTA GGFMTLIKTI PTIVSSFKQS FSKTSINAIE AENAMSGHSG IPRTEQDLSV
RIVIIGSIVL AALMVILPQI PGDSILTKLL IAVLVIVFGF FFVTVASRIV GLIGSSSSPV
SGMTIATIMG TSLVFIGFGL TGKLYEPAVL VVGSMICVAA ANAGATSQDL KTGYIVGATP
KYQQMALFIG VIVSSLVVGA TVKILDTPTP DLLAQGITHA IGSDKFPAPQ GTLMATLIKG
LLSFNLDWQF VLVGAFIAFV FELVGVSALA FAVGLYLPLS TTLPIFAGGL VKAIVDANKK
RKGIVEEDAD LGKGNLFATG LVAGGALAGV AVALLSVNAS IYNGLASLTL EPLLGSALGE
GGYMLLGALL FVGLAIVLWR VGEQKQEG