Gene Slin_2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2251 
Symbol 
ID8725991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2722350 
End bp2723669 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content51% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003387071 
Protein GI284037141 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA CCCCCCGTAC ACTAACTGCC TGGACATACT ATGACTGGGC CAACTCGGTT 
CATTCGCTGG TAATTGTATC GAGTATCTTC CCGGTTTATT TTTCGGCTAC TGCCCTTAAC
GAAACCGGCG GTCCGATTAT CAGCTTTCTG GGATTTTCAC TCAAGAACTC GGTACTCTTT
TCCTACACCA TTTCGGCGGC TTTTCTGTTT ACCGCCCTGC TCTCTCCCAT CTGCTCGGCC
ATTGCCGATT ATAGCGGCCA CAAAAAAGCC TTCATGAAGT TCTTCTGCTA TCTCGGGGCC
ATCAGTTGCA GTCTGCTTTA TTTTTTCACC CGGGAAACCA CCACCTTTTC GGTGATCTGT
TTCTGGCTGA GCCTTATTGG CTGGAGCGGG AGCATTGTCT TTTACAACTC CTACCTGCCC
GACATTGCGA CCGAAGACCA GTACGACCGG GTGAGCGCCC GTGGCTTTTC TATGGGGTAC
ATTGGCAGTG TGCTGCTCAT GATCATCAAC CTGGTCGTCA TTCTGAAACG GGAGTGGTTC
GGCAACATTT CCGAAGCGTT GGCCTCCCGT ATCGCCTTCC TGACCGTGGG CCTGTGGTGG
GTCGGCTTTG CCCAGATTCC GTTCAGCCGA TTGCCGGAAG GCGTCAAAAA AGTGACGCCC
CAAACTCAAA AAAGCAACTA TTTACTAAAC GGCTTTAAGG AGTTGCAGCA GGTGTGGGCC
GACTTACAGC AGCGTCCGCT GGCCAAGCGA TTTCTGGTTT CCTTTTTTGT GTATAACATG
GCGGTGCAAA CGGTCATGTA CGTGGCCACC ATATTTGGCA GCGATGAGCT GAAACTACCG
GGACAAAGTC TGATCATTAC GGTGTTGCTG ATTCAACTGG TCGCTATTCC GGGCGCTTTT
GGTTTTTCCC GACTTTCCGA ATGGCTCGGC AACACCTATG CGCTCATGGT GGCGGTTGTC
ATCTGGATCG GCATCTGTGC GGGTGCATAT TATGTGCAAA CGCAATCGCA GTTTTTTATG
CTGGCGGCCA TTGTCGGGCT GGTTATGGGC GGTATTCAGT CGCTCTCGCG GTCCACGTAC
TCCAAACTCA TTCCGGCTAC TACCGACACC GCGTCCTTTT TCAGCTTTTA CGACGTAACT
GAGAAGCTGT CCATCGTATT GGGCACGCTC GTTTACGGAC TTATCGAACA AATTACGGGG
AGTATGCGCA ATTCGGTACT CGCTCTGTTA GTGCTGTTTG TGATTGGTTT CCTGCTTTTA
TGGCGAATTC CTTCACAAAA AGTTTACCAT ACCCATTTGG AGAACGCAGA AGTTTTGTAA
 
Protein sequence
MKNTPRTLTA WTYYDWANSV HSLVIVSSIF PVYFSATALN ETGGPIISFL GFSLKNSVLF 
SYTISAAFLF TALLSPICSA IADYSGHKKA FMKFFCYLGA ISCSLLYFFT RETTTFSVIC
FWLSLIGWSG SIVFYNSYLP DIATEDQYDR VSARGFSMGY IGSVLLMIIN LVVILKREWF
GNISEALASR IAFLTVGLWW VGFAQIPFSR LPEGVKKVTP QTQKSNYLLN GFKELQQVWA
DLQQRPLAKR FLVSFFVYNM AVQTVMYVAT IFGSDELKLP GQSLIITVLL IQLVAIPGAF
GFSRLSEWLG NTYALMVAVV IWIGICAGAY YVQTQSQFFM LAAIVGLVMG GIQSLSRSTY
SKLIPATTDT ASFFSFYDVT EKLSIVLGTL VYGLIEQITG SMRNSVLALL VLFVIGFLLL
WRIPSQKVYH THLENAEVL