Gene Slin_2637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2637 
Symbol 
ID8726382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3189438 
End bp3190613 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content50% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003387452 
Protein GI284037522 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0859065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0265177 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGCA AAACGTTAAC ATCCGGTCAG ATCGCCATTA TGGCCGTATC GGCAGGGGTG 
TGTGTCGCCA ATATTTATTA TAATCAACCT ATACTTCCTC ATATGGCCCG CACATTTCAG
GCAACTGAAA ACGAAGTTGG CAGGGTGGCT GTGCTCGCTC AGGCCGGGTA CGGTGTAGGG
CTATTTTTTC TGACACCCCT TGGGGATAAA ATCAATCGAA AACGACTGAT GTTGTTTTTG
CAGATTCTGC TCGTCATCGC ACTTGGGTGT ATGGCATTGG CCAGTAGTTT GTTGCTGGTA
AACATCATGA GTTTTTTTAT TGGCCTTTTC GCCGTTTCGG TGCAGATTAT TGTCCCAATG
GCAGCAAGTC TGGCTAAGGA AAATCGGGGG CGGGTTGTTG GTATTATCTT CACTGGCATT
CTGGTAGGGG TATTATTCGC GAGGGTCTTC AGCGGTTTCA TCGCCGAATG GTTCGGGTGG
CGGTATGTAT ACGGTATTTC GGCGGGGATG GTTGGCGTTG TTGCGCTGGC TCTTCAGCTT
TCGTTGCCCA ATGTGGCTTC TGCATTCACA GGCAGTTACG GACAGTTGCT ATCATCGACG
CTTGCACAGG TTCGCCGTTT TCCGCTACTA CGCAATACGG CTCTGTTGGG GGCTATGGTG
TTCGGAACGT TTTGTTCGTT CTGGACTACG CTTACGTTTC ACCTCAGTGG CCCGCCTTTT
CAGTATCAGA CGGGTACCAT TGGCCTGTTC GGCTTACTAG CCATTGGAGC CGCCCTGCTG
GCGCCGGTAT TCGGTAAACA GGCCGACAAA GGCAATGCTA AACAGATACG TCTGTTTATG
GCGTTCCTGC TCATTTTTAG CGTACTGATC GTAAAAGTAT TTCCGCTCTC AGCTACGGCA
TTCATTCTTA CTGTCCTGCT GCTGGATTTG GGCGCACAGT CTATACAGGT GACGAACACG
GCGCTTATCT ACACACTCGA CAGCACTGCG CATAGCCGTA TCAATACAGT ATATATGACC
TCCTACTTCA TTGGTGGAGC CATTGGAACG TTTGTGGGCA TTCAGTGCTG GGCCTGGGGC
GGATGGACGC TGGTGACCTG GCAATTGTTA CTCTGGAGTA GCCTGGCTAT GCTGGTTTTA
CTGCTTGGTT CACGGCTGAA GACAGCTGAC GCTTAG
 
Protein sequence
MNSKTLTSGQ IAIMAVSAGV CVANIYYNQP ILPHMARTFQ ATENEVGRVA VLAQAGYGVG 
LFFLTPLGDK INRKRLMLFL QILLVIALGC MALASSLLLV NIMSFFIGLF AVSVQIIVPM
AASLAKENRG RVVGIIFTGI LVGVLFARVF SGFIAEWFGW RYVYGISAGM VGVVALALQL
SLPNVASAFT GSYGQLLSST LAQVRRFPLL RNTALLGAMV FGTFCSFWTT LTFHLSGPPF
QYQTGTIGLF GLLAIGAALL APVFGKQADK GNAKQIRLFM AFLLIFSVLI VKVFPLSATA
FILTVLLLDL GAQSIQVTNT ALIYTLDSTA HSRINTVYMT SYFIGGAIGT FVGIQCWAWG
GWTLVTWQLL LWSSLAMLVL LLGSRLKTAD A