Gene Slin_1244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1244 
Symbol 
ID8724977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1517895 
End bp1519214 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content56% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003386093 
Protein GI284036163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.477377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0341715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC CGCTGGGAAA TTATCGCTGG ACCATCGTCG CCCTGCTTTT CTTCGCGACG 
ACCATCAACT ACCTCGACCG GCAGGTAGTA GGCTTGCTGA AGCCCACGCT GGAAAAAGAG
TTTAACTGGT CGGAGTTGGA CTACAGTCGG ATCGTGCAGG TCTTTTCGGC GGCTTACGCC
ATTGGTCTGC TCGTTTTCGG CCGGTTCATC GACCGGATCG GCACCAAAAC AGGCTATTCC
ATCGCTATCA TCTTCTGGAG TATCGCGGCT ATGGCCCACG CCCTGGCAAC GAGTACCATG
GGATTTATCT TCGCCCGGAT CGGCCTTGGC CTGGGCGAGG CCGGTAACTT TCCGGCAGCT
ATCAAAACGG TGGCAGAGTG GTTCCCCAAG AAAGAACGGG CGCTGGCAAC GGGCATTTTC
AACTCCGGGG CTAATATCGG TGCCGTTGTT GCGCCTATTC TGGTTCCCTG GATATTGGGC
ATTTACGGCT GGCAAATGGC CTTTATTGTA ACCGGAGCCG TTGGCTTCAT CTGGCTGGTT
TTCTGGTACG TTAGCTACGA AATACCTGCC AAACAGGCCA AACTCAGCAA AGAAGAATTC
GACTACATCC ACAGCGACAA CGAAACCACC CCCGACGACA TTGCCGACCA CGGCAAGCCT
GTTTCCTGGG GTAAGTTGCT GAGTGTTCGC CAGACATGGG CGTTCGTCTT CGGAAAAATG
CTCACCGACC CGATCTGGTG GTTTTTCCTC TTCTGGCTAC AGGATTATTT CTCCACCACC
TTCCACCTCG ACACCAAAAA GCCGAACCTG TATCTGGCCG TACTCTACAC GCTGGTCAGC
ATCGGCAGTA TTGGCGGGGG ATACCTGTCG TCGGCGCTGA TTGGCCGGGG ATGGAGTGTC
TGGAAAGCCC GTAAAACGTC CATGTTCATT TTTGCGCTGC TGGTTATTCC GGTTATCGCC
GTCCGTTTCG GGCCGGGTAT CTGGACAACC GTTGCCCTGA TTGGCCTGGC TGGTGCGGCT
CACCAGGCAT GGAGTGCCAA TATTTTCACC ACGGCGTCGG ATATGTTTCC GAAACGGGCG
GTTAGCTCCA TAGTGGGCAT TGGCAGTATG GCGGGATCGG TCGGCGGCAT CATCTTCCCC
GAAATTGTTG GCCGTATCCT GGACAGCTAT AAACAGGCGG GCGATGTGCA GAGCGGCTAC
GGCATCATTT TCCTGATGTG CGGTTCGGCC TACATGCTGG CCTGGCTGGT GATGCACCTG
CTGGCCCCGA AAATGCAACC CGTTCAACTG GGACTGGCCG AGACCGAGCC GGTATCGTAA
 
Protein sequence
MNKPLGNYRW TIVALLFFAT TINYLDRQVV GLLKPTLEKE FNWSELDYSR IVQVFSAAYA 
IGLLVFGRFI DRIGTKTGYS IAIIFWSIAA MAHALATSTM GFIFARIGLG LGEAGNFPAA
IKTVAEWFPK KERALATGIF NSGANIGAVV APILVPWILG IYGWQMAFIV TGAVGFIWLV
FWYVSYEIPA KQAKLSKEEF DYIHSDNETT PDDIADHGKP VSWGKLLSVR QTWAFVFGKM
LTDPIWWFFL FWLQDYFSTT FHLDTKKPNL YLAVLYTLVS IGSIGGGYLS SALIGRGWSV
WKARKTSMFI FALLVIPVIA VRFGPGIWTT VALIGLAGAA HQAWSANIFT TASDMFPKRA
VSSIVGIGSM AGSVGGIIFP EIVGRILDSY KQAGDVQSGY GIIFLMCGSA YMLAWLVMHL
LAPKMQPVQL GLAETEPVS