Gene Slin_0372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0372 
Symbol 
ID8724100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp472560 
End bp473810 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content52% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003385235 
Protein GI284035305 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.862207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAAC AGTCTCCAGT CGATACGTAC GCATCCCTTC GAATTCCCGA ATTTCGCTAT 
TTCGTCATGA ACAGCTTTCT GATTACAGCT ACCCTGCTGA TTCAGGAGGT TATTCTGGGT
TACGAGCTTT ATAAAATCAC GCACGATCCG CTGATGCTGG GTCTGGTTGG ACTGGCTGAA
GCGATTCCGT TCATTGCGCT GTCGCTTTTT GGCGGTCACC TGGCCGACCG GCGCGATAAG
AAGCGGATTC TGCAATGGAG TTTGCTGGTC ATCCTGATTG GGTCGGTTAT TCTGTATCTG
GTCTTTCAAC CGGCGTTTGC TGCCGGGTTG ACACAAACGG CCCGTTTAGG AACCATCTAT
GGGGTACTGA TGCTGATTGG CACTGCTAAA GGGTTTTACT CGCCGGCCAG CTCGTCGCTC
AAGCCATTCT TAGTGCCTCG TGAACTTTAC GCAAATTCGG CCACCTGGAG TAGTTCGTTC
TGGCAGGCGG GCGCCATTAT AGGGCCGGGT TTGGCGGGTT TTTTATACAG CTGGGTCGGT
TTCGACAATA CCCTGATTGT GGTTATTGCC CTGCTACTGT TCTGTTTTGT CTTGATTTCG
CTCATTGAGC GAAAACCAAC ACCCGTTACA GATTCGCCCG TATTGAAACT CAGCGAAAGT
TTGAAAGAGG GCTTCCGGTT TGTGTTCAAG ACCCAAATTG TTCTCTACGC CATTTCTCTC
GATCTGTTTT CGGTACTATT TGGGGGGGTA GTGGCTATTC TGCCGGTCTT CGCCGAAGAT
ATTCTGAAAG TAGGTGCCGA AGGGCTGGGT TTTTTGCGAG CTGCACCGTC GGTAGGAGCC
TTACTGACAA TGGCCTACAT GACCAAACAC CCACCTACGC ATAATGCGTG GCGCAATATG
TTGTTGTCGG TAGCCGGGTT CGGCGTGGCT ACGATCATCT TCTCGCTGTC AACCAATTTT
TACTTATCCC TCATCATGCT CGGCCTGACG GGCGCTTTTG ATAGTGTGAG CGTCATTATC
CGTCAGACGA TCCTGCAAAT TTTCCCGCCC GATCACATGC GGGGACGGGT GGCTGCAGTA
AACGGCATCT TTGTCAGTTC ATCGAACGAA ATAGGGGCGT TTGAATCCGG CTTACTGGCC
CGTTTGCTGG GTACGGTACC ATCGGTTCTG CTGGGTGGCG TTGTTACGCT GCTGGTTGTT
ACCTACGTGT ACGCCAAATC GAAAGCCCTG CTGGCCGTGC GCTTAAGCTA G
 
Protein sequence
MVQQSPVDTY ASLRIPEFRY FVMNSFLITA TLLIQEVILG YELYKITHDP LMLGLVGLAE 
AIPFIALSLF GGHLADRRDK KRILQWSLLV ILIGSVILYL VFQPAFAAGL TQTARLGTIY
GVLMLIGTAK GFYSPASSSL KPFLVPRELY ANSATWSSSF WQAGAIIGPG LAGFLYSWVG
FDNTLIVVIA LLLFCFVLIS LIERKPTPVT DSPVLKLSES LKEGFRFVFK TQIVLYAISL
DLFSVLFGGV VAILPVFAED ILKVGAEGLG FLRAAPSVGA LLTMAYMTKH PPTHNAWRNM
LLSVAGFGVA TIIFSLSTNF YLSLIMLGLT GAFDSVSVII RQTILQIFPP DHMRGRVAAV
NGIFVSSSNE IGAFESGLLA RLLGTVPSVL LGGVVTLLVV TYVYAKSKAL LAVRLS