Gene Slin_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1203 
Symbol 
ID8724936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1463367 
End bp1464860 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content50% 
IMG OID 
Productouter membrane efflux protein 
Protein accessionYP_003386053 
Protein GI284036123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTGG CGTGGAAACA GATGCGGGTT GTAGTAGCCC TATTAGTAAC GGCAAATTTG 
TCGGCAACAT ACGCACAGAG TACGGGCTCG GCCGTATCGC CGGGCACCGG AGCCGTTGCA
CCAGCCCGGG TAAACTTACA GCAATGTATT GAGATCGCCC AGCAGAACAA TATTCAGATT
CGGCAGGGGC GGTTAACGAT TGCCAACAGC GAGCTGCAAC TGCATCAGGC GCGGCTAAAC
CAATTGCCAA CTGCCGTTTT TCAAGGTAAC CAATCGTTAA ATGGGGGGCG TAGTATTAAC
CCTCAATCCA ACGAATTCGT CCAGCAAACG ATTAACTCCA GTAGTTTTCA GCTCAACTCA
TCAGCGACGC TTTACAACGG TGGTGTGTTG CGCAGCACGG TCAGGCAAAA CGAATTGGCA
CTTCAGGCGG GTCAGCAGGA GTTAAATGCC ACCCAAAATA ATGTCTCGCT AACGGTTGCT
CAAAACTACC TGAACGTGCT GACGGGCACG GAACAATTGG CGGTGGCGCA ACGCCAGGCC
GATGTAACGC GGGCGCAGCT CGAACGTACG CAACGACTGG TCAACGCAGG TTCTGCTCCG
GAAGCCAATC TATTTGAACT TCGGGCTACC TTAGCCAGCA ACGAACTGGA GATCATAAAT
GCTCAGAATA CGCTTGATCT GGCCAAAGTT GCTTTGTTGC AAGCCATGAA TGTGCCAATC
AATCAGGATT TTGAGGTCGA ACCAATTACC GTTCCCGATC CTGGTCTAGA CCCTTACACA
GCGTCTGTAC AGCAACTTTA TGATGTGGCT TCTGCAAACC TGCCAGAAAT TAAAGGCGCA
GATTTGCGGG TTAAAAGTGC CAATTTAGGG GTACAGGTTG CCAAAGGCGG ATTGTATCCG
ACGCTATCAT TAAACGGTAA CCTGAGTACT GTTTATTCCA GCGCAGCTAA AACGGCCGTT
CCCAATGGAC AATCAACGCA GCAAACGATT GGTTTTGTTA CTGATCCCGT TACCGGAACC
CAGATACCCA TTAATACCTC AGTACCGGGT TATGACCGCA CTGGTATCTC TTATGGCACG
CAGTTAAGTA ATAACTTTAG CCAGTCTGTT TCACTTTTTC TGCGGGTTCC GATTTTTCAG
GGCAACTTAT CGCGGAACCG GATTACTACG GCTAAAATCC AACAGCAAAA CGCTGAGTTA
ACCGCCCTGA ACACCCGTCT AACACTACGT CAGCAGATTG AAACGGCGTA TACGCAGGTG
AAAGCCGGAG CGAATCGGTA CCGGGCGACG CAAGCACAGG TTGCCTCGCT GGAGAAGGCG
TTTCAGGTCG CAGAGAGTCG GTTAAATGCC GGTGCCATTA ATGCAACGGA TTACAGCATT
GCCAAAACGA ACCTCGACCG GGCTCGTGCC AGCCTGGTAC AGGCGAAATA CGATTACGTG
TTCCGGACAA AAATTTTGGA TTACTATCAA AATAAACCGC TTAGTTTTAA TTAA
 
Protein sequence
MQVAWKQMRV VVALLVTANL SATYAQSTGS AVSPGTGAVA PARVNLQQCI EIAQQNNIQI 
RQGRLTIANS ELQLHQARLN QLPTAVFQGN QSLNGGRSIN PQSNEFVQQT INSSSFQLNS
SATLYNGGVL RSTVRQNELA LQAGQQELNA TQNNVSLTVA QNYLNVLTGT EQLAVAQRQA
DVTRAQLERT QRLVNAGSAP EANLFELRAT LASNELEIIN AQNTLDLAKV ALLQAMNVPI
NQDFEVEPIT VPDPGLDPYT ASVQQLYDVA SANLPEIKGA DLRVKSANLG VQVAKGGLYP
TLSLNGNLST VYSSAAKTAV PNGQSTQQTI GFVTDPVTGT QIPINTSVPG YDRTGISYGT
QLSNNFSQSV SLFLRVPIFQ GNLSRNRITT AKIQQQNAEL TALNTRLTLR QQIETAYTQV
KAGANRYRAT QAQVASLEKA FQVAESRLNA GAINATDYSI AKTNLDRARA SLVQAKYDYV
FRTKILDYYQ NKPLSFN