Gene Slin_5223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5223 
Symbol 
ID8728989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6374824 
End bp6376068 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content52% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003389994 
Protein GI284040064 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAA ACAGACAGTT ACTGCTTATT TTTTCCATTG TTCTGATCGA TGTTATTGCA 
GGTAGTGGAT TGGGCGTGCT GATTTCAAAC TATGTATTGA ACTTGCCGGC TAAGCCTGTG
CTTATGACGG TTGGCACCGC CCTCATGCTG GGCGTTCAAC TGGCGGGTTC CCCGGCCATT
GGTCGCTGGT CTGATTTGCG GGGCCGTCGA CCGGCAGCCA TTGCTACCAC GGTGGTATCG
CTGCTCTCAT CCCTGTTCTT ACTACCCGTT CAAACCTGGG GATACGTTGC CAGCCGGTGG
GTGAAGGGTG GGTCTAATGG GCTGTACTCT GTTATGCGGT CGGCGGTGGC TGACCTGACC
GATAAAGATG AACTGCTTAA ATATGGCGGA CTGTTTAGTT TCATTGCCGG TTCGGCACCG
GTTATTGGCC CTATGGGTGC CGGGTGGCTC ATGCTGGTAG TTCACGAAGC CCGGATCAAT
CCATTACCTA CGGTTCTGCT CCTGTTGGCA CTAGGTTTGC TGAACATCAG ACTAGCAATG
CTTTTCCGGG AAACCAATCC TAAAAAAGAA GCTGTTGACT ACACCGAACT GGCCGATAAA
GCGCGTAACT CGCTGAAAGT CGTTTCTATC TGGCGGCAAC TCCTTGAGGC CGATAAACAG
CTTCCGGGTA TAAAATCTAT TCTGATTCTA AACTTGCTGG CTACCCTCGG TATGGGGTAT
TTCGCTTTTT TTGTGGCCTT CCTGACCCAG AGTGAGCTTA TTATGACGCC CGCCGAAACA
GCCCGGTTTT TCCTGTACTA CGGTGGTCTG GCGTTAGCCG CCAACTTTAT TTTCTTTACC
TACATCGTTC AGCACGTCAA TAAGCGCATC GCGATTCTGG TTATGGCGTT GATCAGCATT
GTTTTGCAGG TAGTGTATAC CTTTTCAGAA TCGTCGGTCG AGCTGTTTTA TGTAGTGGCC
GGTGTCGACG CGCTCACCGT TTCCATTATT ACCGGCCTAA CCGGGGGCAT CCTATCGCAG
GTGATCAAGG AAGGCAGCGG ACAGGGCGAA ATGTTCGGTA ATATACAGGC GCTGGGCGGG
CTGGCCAGTT TTGCAACGGC ATTGGTAAAC AGTCTGCTCT CGGGAGTAAG CCTGAAAGCC
CCGTTTATTT TCTGTGCTAT CAGCATGATT GCCGTCGTTA TCTGGACAAT ACGCCTGCCG
AAAGCAGCCA GGCAACATAC CGACTCGAAA ACGCCTGCGA CCTGA
 
Protein sequence
MSKNRQLLLI FSIVLIDVIA GSGLGVLISN YVLNLPAKPV LMTVGTALML GVQLAGSPAI 
GRWSDLRGRR PAAIATTVVS LLSSLFLLPV QTWGYVASRW VKGGSNGLYS VMRSAVADLT
DKDELLKYGG LFSFIAGSAP VIGPMGAGWL MLVVHEARIN PLPTVLLLLA LGLLNIRLAM
LFRETNPKKE AVDYTELADK ARNSLKVVSI WRQLLEADKQ LPGIKSILIL NLLATLGMGY
FAFFVAFLTQ SELIMTPAET ARFFLYYGGL ALAANFIFFT YIVQHVNKRI AILVMALISI
VLQVVYTFSE SSVELFYVVA GVDALTVSII TGLTGGILSQ VIKEGSGQGE MFGNIQALGG
LASFATALVN SLLSGVSLKA PFIFCAISMI AVVIWTIRLP KAARQHTDSK TPAT