Gene Slin_5402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5402 
Symbol 
ID8729169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6570892 
End bp6572187 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content55% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003390168 
Protein GI284040238 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACGA ACAAACCTCT CGAATCGCTG AACGTTCTCA GTTTCAAAGG TGTGCAGATG 
CGGACCTTCC ACATTACCTG GCTCACTTTC TTTGTGTGTT TCTTCGGCTG GTTTGGGCTG
GCTCCGCTGA TGCCCGCCAT CCGGGCCGAT CTGGGGCTGA CCAAACCGCA GGTCGGCAAT
ACAATCATTG CCGCCGTGTC AGCTACCATT CTGGCCCGTC TGGTCATTGG CCGCCTGTGC
GACACCTGGG GACCACGTAA GACCTACACG GCGTTGCTGT TGTTGGGCTC GCTGCCCGTT
ATGTTCGTTG GTCTGGCACA TGATTACACC ACCTTTCTGC TATTCCGACT GGCGATTGGC
GTCATTGGCG CGTCGTTCGT GATCACGCAG TTTCATACAT CCATGATGTT TGCCCCGAAG
ATCAAAGGTA CGGCCAACGC TGTAGCCGGT GGCTGGGGCA ACCTCGGTGG CGGTATTACT
CAACTGGCGA TGCCGCTCAT CATGGCGACC ATCGTTGGTT TCGGCTACAC CAAGCCCGAA
GCGTGGCGGC TGGCCATGAT CGTACCGGGT GTGCTAATGC TTGTGATGGC CTTCATTTAC
TACCGCTTTA CCCAGGATAC ACCCGCTGGC AATTATGGTG ATATAGAGAG GTCGATTACC
AAGGGCGAGA AAGTAAGTTT CTGGGCGGCC TGCGCCGATA TCCGCGTATG GGCGCTGGCA
CTGGCTTATG CCTGCTGCTT CGGGATGGAG ATTACGTTCG ATGGCGTGGC CGCGCTTTAT
TTCTTCGATA ACTTCAAGCT GGCCGAAACG GAAGCCGGTT TCTGGGCCAT GCTTTTCGGT
GGCATGAACA TTTTTGCCCG TGCGCTCGGC GGTATCGTGG CCGATAAAGT CGGGAATAAA
TATGGTATGC GGGGCAAAGG TATTCTGCTG GCGGGTATGC TGGTGCTGGA AGGAGCGGGC
ATTATGCTTT TCGCGCAGGC CGGTAGCCTG CCAATGGCTA TCATGTCCAT GATCACCTTC
GCGTTGTTCC TGAAAATGTC GAATGGCGGT ACGTATGCCA TTGTGCCGTT CGTGAACCCC
AAAGCTGTGG GCGTCATTTC GGGCGTAGTT GGTGCAGGCG GAAACCTCGG CGGTATGCTG
ATGGCCTTCC TCTTCAAATC GCAGTCCATC ACCTACGGAC AGGCGTTTCT ATACATCGGC
TGCGCCGTAG CCGCCATTGG ATTGCTCTTA TTCCTGGTCA ATTTCAGCAA AGAGACCGTC
GCCGAATCAG CGGAAGTGGA ATTGCAAACA GCTTGA
 
Protein sequence
MQTNKPLESL NVLSFKGVQM RTFHITWLTF FVCFFGWFGL APLMPAIRAD LGLTKPQVGN 
TIIAAVSATI LARLVIGRLC DTWGPRKTYT ALLLLGSLPV MFVGLAHDYT TFLLFRLAIG
VIGASFVITQ FHTSMMFAPK IKGTANAVAG GWGNLGGGIT QLAMPLIMAT IVGFGYTKPE
AWRLAMIVPG VLMLVMAFIY YRFTQDTPAG NYGDIERSIT KGEKVSFWAA CADIRVWALA
LAYACCFGME ITFDGVAALY FFDNFKLAET EAGFWAMLFG GMNIFARALG GIVADKVGNK
YGMRGKGILL AGMLVLEGAG IMLFAQAGSL PMAIMSMITF ALFLKMSNGG TYAIVPFVNP
KAVGVISGVV GAGGNLGGML MAFLFKSQSI TYGQAFLYIG CAVAAIGLLL FLVNFSKETV
AESAEVELQT A