Gene Slin_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3423 
Symbol 
ID8727176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4150299 
End bp4151507 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content53% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003388230 
Protein GI284038300 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAA CAGCCTCTCT GAATAAACCA AATCGAAACC TGTTCATTCA AAACCTGCTG 
ACGTTCTTTC TGAACCGGCA GGCGCTGGCC GTTGGGCTGG TCTTTGCTTC CGATAGTATT
CTGTTCGGCA GTTGGGTTGC CCACATTCCG TTTGTCAAAG CCAAACTCCA TCTTTCTGAT
GCTGAACTGG GACTAACGCT GTTTGCCATG CCCATTGGTT TACTGGTCAT GAATCCGCTA
ACCGGCTGGA TTATTGCCCG GCTTGGTGAA GCCCGCGCCT GTTTCTGGTC GGCGGTAGGT
TTGACATTAG CCGTTTGCAT CCCGCTCAAC GCGCCAAACC CGATAGTTCT TTGCCTTGGT
TTATTCCTGA TGGGATTGAA TGCAGCCCTC ATCAACGTGG CCATGAACAC CACCGCTACA
AATCTGGAAC GGGCACAGGG TATTGTTATC ATGTCGTCGT GCCATGGCAT GTGGAGTCTG
GGCGGATTGT TTGGCTCAGG TATTGCAGGG GCTGTTATCG CGTTGCATGT ATCGCCCCCC
ATTCATATCA TGATCATGGC AGGCTTAATT CTGCTCATGA CTTTTATTCT TCAACCGATT
CTGGCAAAAA TCCCGTCGAG TAGCCGAACA GAAACGGGCG AGAAAGCCGG GTCGTCATTC
GTTCGACCTA ATCTGGACTT GTTGTTAATG ATTTTGATCG GCTTGTCGCT GGCCATGGGC
GAAGGGGCCG CTTTCGACTG GAGCGCTGTG TATTTGCGTG AGACACTTGG CGCCAGCAGT
CAGATTGCCG CGCTGGGCTT TGGCGCGTTC TCGCTCACTA TGACCGGTTT CCGTTTCCTG
GGCGATGCGA TCATCCCTAA AATAGGTGCC AAACGCTGGC TACAGATTGG GGGAGTTATT
GGGGCAGCGG GTCTCTTATT CGCCATCGCT CTCCCCTACC CGGCAACGGC GTTGATTGGC
TTTGGCGTGC TGGGAGCAGG TTGCTCGTTG GGTGCACCGG TGCTGTATGC GGCCTCTATG
CGCGTAGAGG GCATTCCGCC CGCAGCTGGC CTGGCTACGT TTGCGACATT TAGTTTTATT
GGCTTTCTGG CAGGTCCGCC CATCATCGGG TTTGTAGCCG AAGCCTTTGG ACTGGTGTAT
GGATTAGGCT TCGTAGCCAT CATGCTACTG ATTTCGGCCG GTTTGGCCAA ACTAGTAAAG
CTATTTTGA
 
Protein sequence
MNTTASLNKP NRNLFIQNLL TFFLNRQALA VGLVFASDSI LFGSWVAHIP FVKAKLHLSD 
AELGLTLFAM PIGLLVMNPL TGWIIARLGE ARACFWSAVG LTLAVCIPLN APNPIVLCLG
LFLMGLNAAL INVAMNTTAT NLERAQGIVI MSSCHGMWSL GGLFGSGIAG AVIALHVSPP
IHIMIMAGLI LLMTFILQPI LAKIPSSSRT ETGEKAGSSF VRPNLDLLLM ILIGLSLAMG
EGAAFDWSAV YLRETLGASS QIAALGFGAF SLTMTGFRFL GDAIIPKIGA KRWLQIGGVI
GAAGLLFAIA LPYPATALIG FGVLGAGCSL GAPVLYAASM RVEGIPPAAG LATFATFSFI
GFLAGPPIIG FVAEAFGLVY GLGFVAIMLL ISAGLAKLVK LF