Gene Slin_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3785 
Symbol 
ID8727543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4548221 
End bp4549609 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content49% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003388579 
Protein GI284038649 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.664159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0826493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACG ACCGCGTGAT CATCCTAGTA GTCATTTCTC TTTGTGTATT GTTGGCATCG 
GTAGGGACAA GCATCGCCAA TATTGCCCTT CCGGTACTTG AGCGCTCGTT TTACGCCACA
TTTGAGTCGG TTCAGTGGGT GACGATTGCT TATCTGTTGG CTAGTACGGT TTCCGTCACC
ATTGCTGGTA AGCTGGCCGA TCGGTATGGG CATCGCCGGG TTTTGCTGGC CGGTATCCTG
CTGTTTACAA TTGCTTCATT TTTGAGCGCA TTAGCCTCCA CTATGTCAGT ACTCATCCTA
CTGCGAGCCG GTCAGGGTAT GGGAGCCGCT GTTCTGATGA CAAGTGGGAT CACCCTGATA
AAAAAAAATA ATGCGATTCT GAAAACAGGC AGCGCAATGG GGTTAATCGG CACCATGTCG
GCTATTGGTA CCGCGCTGGG GCCTTCGGTT GGCGGCCTGT TGCTAACCAT CTGGGGCTGG
CCCGCTATTT TCTTATTCCT TTCGTTGCTG GGTACACTGG TTTTTTTTCT GGTCATAACC
TATATCCACA AAGACAACCT GACTTCTAAA AGCCGCCAAC CGATCGATGG TTTATCGGTC
GCAGCCCTTA CACTATCGAT CACCGCTTAT GCTCTTTCCA TGACCTTGGG TAAAAAAGGA
GTTGATTGGC TAAACATCCT GTTGATTCTG GTCTCGTTGT TTTCTGGTGG GTTGTTCGTC
TATCATCAAA CCCGCAGTAA CAACCCGTTG CTTCCCGTAA AAACCTTTAA AAATCGTGTG
GTAAGCCGTT CACTGGTCGC CAATTTTGTC GTTTCCAGTA TTATGATGAC AACCCTGGTG
GTTGGCCCTT TTCTACTGAC CATTGGTCTG GGACTAGACG AATTTAATGC GGGGCTAGTG
ATGTCGGTCG GTCCTGTCAT TTCGATTCTG ACGGGCATAC CAGCTGGAAA ACTGGTTGAT
ACACAGGGAC CCGATCGAAT CCTAAAAATA GGGCTACTCA GTCTGTTAAT GGGTACCTTG
GCATTAGCCC TTTTACCGGC TGCCTGGGGC TTGGTGGGCT ATTTACTTGG CATTTCGTTG
CTTACCCCAG GCTACCAGTT CTTTCTGGCT GGTAACAACA CTGCTGTCAT GTCCCAAGCC
AGTAGCCACC AAGAAGGTAT GATAGCGGGT ATCCTCAATT TATCCCGCAA TCTGGGACTG
ATTACCGGCT CTTCCGTGAT GGGTGCTTTG TTTTCGGTTT CCGTAGCCGT TCAGCCGATC
CGGGAAGCGA AGCACGCGGA ACTATTTTTC GGGGTAAGGA TCACGTTTGG GGTGGGTGTA
CTTTTATTGG TGCTGGTGGG TATAACCTCC TGGATAAATT CCTCTTCAGC TGATGTGTCT
CATCAATGA
 
Protein sequence
MKNDRVIILV VISLCVLLAS VGTSIANIAL PVLERSFYAT FESVQWVTIA YLLASTVSVT 
IAGKLADRYG HRRVLLAGIL LFTIASFLSA LASTMSVLIL LRAGQGMGAA VLMTSGITLI
KKNNAILKTG SAMGLIGTMS AIGTALGPSV GGLLLTIWGW PAIFLFLSLL GTLVFFLVIT
YIHKDNLTSK SRQPIDGLSV AALTLSITAY ALSMTLGKKG VDWLNILLIL VSLFSGGLFV
YHQTRSNNPL LPVKTFKNRV VSRSLVANFV VSSIMMTTLV VGPFLLTIGL GLDEFNAGLV
MSVGPVISIL TGIPAGKLVD TQGPDRILKI GLLSLLMGTL ALALLPAAWG LVGYLLGISL
LTPGYQFFLA GNNTAVMSQA SSHQEGMIAG ILNLSRNLGL ITGSSVMGAL FSVSVAVQPI
REAKHAELFF GVRITFGVGV LLLVLVGITS WINSSSADVS HQ