Gene Slin_5749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5749 
Symbol 
ID8729524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6991839 
End bp6993149 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content51% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003390513 
Protein GI284040583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAT TTCCAACCAG CTACCGATGG CAGATGACCG CCTTGCTTTT TGTGGCCACC 
ACCATCAACT ACCTGGATCG GCAGGTTATC AGCCTGTTAA AACCAACCCT GGAAAAAGTC
TTTGTCTGGA CAGAAACGGA CTACAGCCAT GTCGTTATGG CGTTTCAAGG CGCTTATGCC
CTCAGTTTCA TTGGCTTTGG CTGGGCAATT GACCGGATTG GCGCAAAAAT CGGGTACACT
CTATCCGTCT TTGTCTGGAG TATCGGGGCC ATGCTTCACG CTGTAGCCAG CAGTACGTTT
GGTTTCGGCG TTTTCAGAGC CCTGCTGGGG CTGGGCGAAG GAGGAAATTT TCCGGCCTCT
ATTAAAACGG TTACCGAATG GTTTCCGCAG AAAGACCGGG CGTTTGCTAC AGGTATTTTT
AACTCAGGAA CCAACATCGG CGCCGTTGTT GCCCCGCTCA TGGTTCCCTG GATTCTGGGG
CAGTATGGTT GGCAAGCGGC TTTTCTGGCT ACGGGAGCCA TTGGTTTTAT CTGGCTGATC
TTCTGGTGGA TGTATTATGA AAAACCCAGC CTTCACAAAA AAATCTCGCC CGAAGAACTG
GATTACGTTA GCGAAGCCGT CCCAACGAAT GTCGCTGAGC CTGCGCTGAA GTGGCTCGAC
TTGCTGAACC GTCGGCAGAC CTGGGCTTTC ATCCTGGGAA AACTTCTAAC CGATCCGGTT
TGGTGGTTTT TCCTGTTCTG GTTGCCTTCC TTTTTGGCGT CTACCTTCCA GATCAATCTA
ACCAAACCCA GCTTACCCCT CATCCTGATT TATTCGTCTG CCACCATTGG TAGCGTGGGG
GGTGGTTATA TCTCCGGTTA TTTTCTGGGG AAAGGCTGGT CGGTCAACTG GGCACGCAAA
GCGTCGATGC TGGTCTTTGC GTTGTGCATT GTTCCTATTA TGGCTATTCA GTTTGCCTCT
GAACTCTGGC AGGTAATTGC GTTACTGGGG CTGGCCGTGG CAGCACACCA GGCCTGGAGC
GCCAATCTGT TTACCCTTGT GTCGGACCTG TTTCCGCGAA AAGCCGTAAG TTCAGTTGTC
GGGATAGGTG GCATGGCTGG CTCCATTGGT GGTTTACTGT TTCCGCTACT GGTCGGTTCG
CTGCTGGATT CCTACAAGGC AGCGGGCAGT ATCGCAACCG GCTACAATAT TCTTTTTGTA
CTGTGTGGTA GCGCTTACCT CATCGCTTTG CTAAGTATTC ATTTACTGGT TCCGGCCATG
AAACCAGTAT CGATTAATTT AGGCGACCCA AAACGTACTC CCGAAAAATA G
 
Protein sequence
MPKFPTSYRW QMTALLFVAT TINYLDRQVI SLLKPTLEKV FVWTETDYSH VVMAFQGAYA 
LSFIGFGWAI DRIGAKIGYT LSVFVWSIGA MLHAVASSTF GFGVFRALLG LGEGGNFPAS
IKTVTEWFPQ KDRAFATGIF NSGTNIGAVV APLMVPWILG QYGWQAAFLA TGAIGFIWLI
FWWMYYEKPS LHKKISPEEL DYVSEAVPTN VAEPALKWLD LLNRRQTWAF ILGKLLTDPV
WWFFLFWLPS FLASTFQINL TKPSLPLILI YSSATIGSVG GGYISGYFLG KGWSVNWARK
ASMLVFALCI VPIMAIQFAS ELWQVIALLG LAVAAHQAWS ANLFTLVSDL FPRKAVSSVV
GIGGMAGSIG GLLFPLLVGS LLDSYKAAGS IATGYNILFV LCGSAYLIAL LSIHLLVPAM
KPVSINLGDP KRTPEK