Gene Slin_3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3621 
Symbol 
ID8727374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4376768 
End bp4378558 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content50% 
IMG OID 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003388427 
Protein GI284038497 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.434287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCAAT CTCTAAGTAC CGTCGATCTG GTCGTAGTCG TTTTGCTATT GCTGTTTCTG 
GTGACGGCCA GTATGTATTC CAGCTTTAAA AAGAAAAGCT CGGAAGAGTA CTTCATGGCG
GGACGGTCGC TGAAATGGTA CTCCGTAGCG GGGTCAATTT TTGGTACGAA TATCCACGCC
CAGCAGATTA TTGGCATGAT GGGCGTAGGC TATTCCATTG GCTTTGTCCA GAGTCATTAC
GAAGTATGGG CGGTGCCAGC CATTCTGGTA CTGGTGTATA TTTTCATTCC TATTTACCGA
AAACGGCAGT TTTTTACCCT TTCCCAATTT CTGGAAAACC GTTACAGCGG CCAGACGCGC
CTGGTTTATA CCATCCTGAT GATAGCCTTC ATCATGATTC AGTTGATTGG CGGTTTCTAC
ATCGGCAGCC GTACGCTGGG CATTCTGTTT GAAGGCACCA GTTGGGAACT GACCTATTTA
CAAGGCATTC TAATCATTGC AACGGTTACC ATTCTGTTCA CTGTTTTCGG AGGGATGGAA
TCGGTTGTAA TTGCGGATAA TATCCTGACC GTTGTGATGA TCGTGTCGGT GTTGCTGATG
GGTACGCTGA CCTACCTACA ACCCGAAATA GGCGGAATCA GCGGTCTGCT GAAGCTGGAC
CATGCCGAGG CCAATAAAAT GCATTTGTAC CTGCCAGCCA GTCACCCCAA GCTTCCCTGG
CTGGGCATCT TTACGGGCCT CACCATTCTT AACTTCTTCT ACTGGACAAC CAACCAGTAT
CAGGTACAGC GGGTGCTGGC TGCGCAAACC GAACGTGATG CCAAGCTGGG CTCCATTGCC
GCCGGATTTT TGAAACTTAC TATTCCGTTC TTTTCCATCG GAGCGGGAAC GGCGGCCTTT
TATTTGTTCA GAGCCCGATT CGGTGAGAAT AGTATTAAGC CCGACGATAC GTTCCTGACG
CTCCTGAAGA CGGTTGTGCC GGTCGGTTAC GGGTTTGTAG GATTAATACT AGCCGGGTTG
ATGTGCGCTA TATTTTCGGC GATCTACTCG ATGATGAACT CCGTTTCGAC CATGCTGGCG
TATGATGTGT ACCGCAAATA CCTCAAGCCT AATGCGTCGG ATAAGCTGAC TGTCCGGTTC
GGGCAGGGGG GCGTTTTTGT GATGTGTGCC ATTGCTACCG GCCTTGCCTA TACCACCTTC
GACCCGACCT CGTCCGAGAA TTTTTTCCTG ATTCTGGCTA ACCAGACGTC TTACCTGAAA
CCCGGTTTGG TGGTGGTGTT CTTTTGGGGT GTACTCTGGC AAAAAACCAA CCCGAAAGCG
GCCGTTATTG TGCTTATGAG TTCCCCACTG ATCGGTTTTG GCTGCGACTG GCTTTATGAC
CATGTCCTAG TCAACTCACT CTGGGTACGC GATACGTTTG GAGAAACGCT CAATTTCCTG
TACCGCGTGT TCCTGATCTT TCTGATCGGA TCGGTGTTGA TTGCGGTGCT GAGCCTTTAT
TTTAACCGCC GGACCGGTCC CGTACAAACA CACGACATGA CCGTTTCGGT CAATGGCATC
GGGAGTGCTT TGCTGCGATT TGCGGTGCTT CAGGGGCCAT TGCTGGCGCT GGTACTACTT
GATCGGCTAT CGCCCCAGCA AGCAGCTCTG CCATCTGCCA TTCTTACAAT AGCCTTGTTC
GGGTGGTATC TGAAGCGCGA AAAAGAGGCC GTTGCCTTTT ATCAGTCCGA CATTTTTTAC
GCTGGGATAC TTACGGGCGT TATGATGTGG ATTATGTATT ATTTCGCCTG A
 
Protein sequence
MLQSLSTVDL VVVVLLLLFL VTASMYSSFK KKSSEEYFMA GRSLKWYSVA GSIFGTNIHA 
QQIIGMMGVG YSIGFVQSHY EVWAVPAILV LVYIFIPIYR KRQFFTLSQF LENRYSGQTR
LVYTILMIAF IMIQLIGGFY IGSRTLGILF EGTSWELTYL QGILIIATVT ILFTVFGGME
SVVIADNILT VVMIVSVLLM GTLTYLQPEI GGISGLLKLD HAEANKMHLY LPASHPKLPW
LGIFTGLTIL NFFYWTTNQY QVQRVLAAQT ERDAKLGSIA AGFLKLTIPF FSIGAGTAAF
YLFRARFGEN SIKPDDTFLT LLKTVVPVGY GFVGLILAGL MCAIFSAIYS MMNSVSTMLA
YDVYRKYLKP NASDKLTVRF GQGGVFVMCA IATGLAYTTF DPTSSENFFL ILANQTSYLK
PGLVVVFFWG VLWQKTNPKA AVIVLMSSPL IGFGCDWLYD HVLVNSLWVR DTFGETLNFL
YRVFLIFLIG SVLIAVLSLY FNRRTGPVQT HDMTVSVNGI GSALLRFAVL QGPLLALVLL
DRLSPQQAAL PSAILTIALF GWYLKREKEA VAFYQSDIFY AGILTGVMMW IMYYFA