Gene Slin_5026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5026 
Symbol 
ID8728791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6126394 
End bp6128022 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content52% 
IMG OID 
Productsulphate transporter 
Protein accessionYP_003389802 
Protein GI284039872 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0023678 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACAC AAGTTAAACC TATTATTCAC GTCGTTCCGG CGAAAGGTCT TGCCGGGTTG 
AAAGAAAATT GGCAGTCCGA TTTATTATCC GGATTCCTGG TTTCGCTGAT TGCCTTGCCA
CTCAGCCTTG GTATCGCATC AGCCAGTAAT TTTCCGCCAA TCATGGGCGT GCTGACAGCC
ATTGTTGGCG GGTTGATCGT TGCGTTGTTT GCGGGTTCGG AACCAACCAT TAAAGGGCCG
GCGGCCGGGT TGATCGTAAT TGTTGCCGGT TGCGTAGAAG AATTAGGAAA GGGAGACCTC
GAACTAGGCT GGAAACTGAC CCTGGGGGTT ATCATAGCGG CCGGTGTCCT GCAAGTGGTT
ATGGGCATAC TGAAAGTGGC CAAACTAGCC GACTTTTTTC CGCTGTCGGC CGTCCATGGT
ATGCTGGCAG CCATCGGGAT CATCATCATG TCGAAGCAGA TTCACCTGGC CGTTGGTATC
GCCCCATCCG AACTAAAGGG AAAAGAGCCG CTCGAACTGC TGGCAATGGT GCCCCACAGT
CTCAGCCATA TGGAGTGGCA CGTAGCAGTG ATCGGGCTCG TTAGCCTGGT TATTCTGTTT
AGCTGGCCTA ACATCAAAAG CAAAGCGATC AAGCAAATCC CTCCCGCACT GGTTGTTCTG
GTCGTAGCCA TTGCCCTGGG ATTATACTTC AACCTGTCTG ATACAAAACT ATACAGCGCC
ATTAAACCAC TGGTCAATCC CGGCGAGTTT AAGCTGTCGT ACAATGCAAA CTTTGGAGCC
TGGTCGGGCG ACATGCTGCC CGTCGCCCTG AAATACCTCG CCATGTTCAC TATTATTGGC
TCGCTGGAGT CGCTGCTGAC GGGTAAGGCA ATCGACCTGC TTGACCCCTA CAAGCGTAAA
TCGAATTTAA GCAAGGATTT AACAGCGGTT GGTATCGGCA ACATGGTGTC GGCAGCACTG
GGTGGCCTAC CTATGATTTC GGAAGTAGCC CGTTCGTCGG CTAACCTGAC CAATGGGGGC
AAAACTCGCT GGGCAAACTT CTTTCACGGC GGATTCCTGC TGCTTTTTGT AGTGGCTCTT
GTTCCGCTCA TCAAACTTGT TCCGGTAGCG GCACTGGCGG CCATCCTGAT TGCCGTTGGG
TTCCGGTTGG CTGCTCCCAA AGAGTTTCGC CATATGCACC ACATAGGAGC TGAGCAGTTG
ATCGTATTTG TGATTACAAT TATAGCTACG CTGGCTACCG ACTTGCTGGT AGGCATTGCG
GTGGGTATTG CCGCCAAGTT TGTTATCCAG CTGGCACTCG GCCTGCCTAT AAAATACCTT
TTCAACCCCC AGCAGGAGCT TATCTCCGAA GGATCGCACC ATACACTTAC CATCACCGGA
GCCGCTGTGT TTACCAATTA CCTGTCGATC AAAAAGCAAC TGGACACTAT CCCACAGGAG
GCAGGTCAGC ACGTCACAGT CGACTTACAC CAGGCCCGGT TCGTAGACCA TACCGTTATG
GAAAATCTGC ACAATTACGA GCGTGATTTT CAACTGGCCG GTGGCGAATT CCACGTCATT
AACCTCGATG GGCACCAACC CATGTCGACG CATCCGCTGG CGGCCCGTCG AAAAAAAATG
GCCATTTAA
 
Protein sequence
METQVKPIIH VVPAKGLAGL KENWQSDLLS GFLVSLIALP LSLGIASASN FPPIMGVLTA 
IVGGLIVALF AGSEPTIKGP AAGLIVIVAG CVEELGKGDL ELGWKLTLGV IIAAGVLQVV
MGILKVAKLA DFFPLSAVHG MLAAIGIIIM SKQIHLAVGI APSELKGKEP LELLAMVPHS
LSHMEWHVAV IGLVSLVILF SWPNIKSKAI KQIPPALVVL VVAIALGLYF NLSDTKLYSA
IKPLVNPGEF KLSYNANFGA WSGDMLPVAL KYLAMFTIIG SLESLLTGKA IDLLDPYKRK
SNLSKDLTAV GIGNMVSAAL GGLPMISEVA RSSANLTNGG KTRWANFFHG GFLLLFVVAL
VPLIKLVPVA ALAAILIAVG FRLAAPKEFR HMHHIGAEQL IVFVITIIAT LATDLLVGIA
VGIAAKFVIQ LALGLPIKYL FNPQQELISE GSHHTLTITG AAVFTNYLSI KKQLDTIPQE
AGQHVTVDLH QARFVDHTVM ENLHNYERDF QLAGGEFHVI NLDGHQPMST HPLAARRKKM
AI