Gene Slin_3956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3956 
Symbol 
ID8727714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4747230 
End bp4748579 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content53% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003388745 
Protein GI284038815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.621866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.612872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCGC CAGCTTCTAT ACCGCGCCAA CCTAACACTT TAACCCCTAC CCCTGTGACT 
ACTTCCAAAA CCCATAATTA CCGGTGGATT ATCGTCGTCC TGTTATTCAC CGCAACCACA
ATCAACTACC TCGACCGGCA GATCATCGGT CTGCTGAAAC CCATTCTGGA AGTTGAATTT
TCCTGGACCG AAACACAGTT TGCCAACATC GTGATCGCCT TCACGGCAGC CTATGCCGTT
GGTCTTTTAG TGTTCGGCTG GTTTATCGAT AAGGTTGGCA CAAAAATAGG ATACGCCGTC
ACAATCGTCT GGTGGAGCGT GGCCGGTATG CTGCACGCGC TGGCCCGCAG TGCGTTTGGT
TTTGGTTTGG CCCGCGTAGG GTTAGGGCTG GGCGAAGCAG GGAATTATCC AGCCGCCGTG
AAGACCGTTG CCGAGTGGTT TCCCCAGAAA GAGCGTGCCC TGGCAACGGG TCTATTTAAC
GCCGGTACAA GCATTGGCGT GGTGGCGGCC CTGCTCATTG TACCCTGGAT TTTGAGCCAT
TACGGCTGGC AGGAAGTGTT CTGGATTACG GGTGCTATGG GCTTTGTCTG GCTCATTTTC
TGGCTCATTT TCTATGAAGT TCCAGCGCGG CAGCAACGGC TGTCGGCTGA GGAGTATGAT
TACATTACCA GTGGTCAGGA AGCAGAAACC GAAAAGAAGC TGCCTATCAA GTGGTTCAGG
TTGTTTACCC TTCCGCAGAC CTGGGCGCTC ATCACCGGCA AAGGACTTAT CGACCCCATT
TACTGGTTTT TCCTGTTCTG GCTCCCTTCC TATTTCTCGT CGACATTTAA GCTGGACCTT
AAGAAACCCA GTCTGGAGCT GATGCTCATT TATCTGGCCA CCACCGTGGG CAGCATTGGT
GGCGGGTATT TATCGTCATG GCTCATCAAA CGCGGCTGGC CGACGCTGAA AGCCCGAAAA
ACCGTTTTGA TCATATTTGC CGGGTTGGAA TTATCCATTA TCCTTGCTCA ATTTGCGACG
GATGTCTGGG TAGCGGTGGG GCTGATCAGT CTGGCGGTGG CCGTTCACCA GGCCTGGGCG
ACCAACGTGT TTACGATGGC GTCTGATTTA TTCCCTAAAC AAGCGGTCAG TTCGGCCGTT
GGTATAGCGG GGATGGCCGG AGCCGTGGGC GGAATTTTCT TCCCGATGCT AGTCGGTCGT
TTGCTGGACA CCTACAAAGC GGCTGGCAAT CTGTCGGGGG GTTACAACGT ATTATTCACC
ATCTGCGGAT TTACGTACCT GACAGCCTGG GTTATCATCC ACCTGCTTAC CCGAAAGCCT
AAACCAGTCG ATATAAGCCA ACTGACGTAA
 
Protein sequence
MPAPASIPRQ PNTLTPTPVT TSKTHNYRWI IVVLLFTATT INYLDRQIIG LLKPILEVEF 
SWTETQFANI VIAFTAAYAV GLLVFGWFID KVGTKIGYAV TIVWWSVAGM LHALARSAFG
FGLARVGLGL GEAGNYPAAV KTVAEWFPQK ERALATGLFN AGTSIGVVAA LLIVPWILSH
YGWQEVFWIT GAMGFVWLIF WLIFYEVPAR QQRLSAEEYD YITSGQEAET EKKLPIKWFR
LFTLPQTWAL ITGKGLIDPI YWFFLFWLPS YFSSTFKLDL KKPSLELMLI YLATTVGSIG
GGYLSSWLIK RGWPTLKARK TVLIIFAGLE LSIILAQFAT DVWVAVGLIS LAVAVHQAWA
TNVFTMASDL FPKQAVSSAV GIAGMAGAVG GIFFPMLVGR LLDTYKAAGN LSGGYNVLFT
ICGFTYLTAW VIIHLLTRKP KPVDISQLT