Gene Slin_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3026 
Symbol 
ID8726778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3661958 
End bp3663127 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content51% 
IMG OID 
Productgalactokinase 
Protein accessionYP_003387836 
Protein GI284037906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTAC TTACCAAGCT TACCGATTCA TTCCAGCAAG CCTTCCCGGC TGAAACTCCC 
GGCCAACAAA CCCCGCTACT GATTTGCTCA CCCGGTCGTG TCAACCTTAT CGGCGAACAT
ACAGATTACA ATGAAGGGTT TGTGTTACCA GCCGCTATCG ACAAAGCGAT TTACCTGGCC
GTTGGTCCCC GTTCTGACAA CGAACTACAT TTTATAGCCC ACGACCTCAA CAAAACATTT
CAGGGTTCTT TAACTGATTT GACACCTACG CACACCTGGG CTGATTACTT GCTCGGCGTT
GTTGCGCAGT TTCGGCAGGC GGGCCATCAA TTTAGTGGTT TCAATTGCGT ATTTGGCGGC
ACCATTCCCA TGGGTTCAGG GCTATCATCG TCGGCAGCGC TCGAAAACGG GGTCGGGTTT
GCGCTCAACG AGCTTTTTCA GCTGGGCATC GACCGGATTG CACTGGTAAG ATTATCACAA
CGCGCCGAAA ATGAGTTTGT AGGAGCGAAA GTGGGCATCA TGGACATGTT TGCCAGCATG
ATGGGTAAAG CCGATCACGT TATTAAACTC GACTGCCGCT CACTCGACTA CACCTACGCT
CCCTTGCAGA TGAACGGGAT CAGCATCGTT CTCTGCGACT CCAAAGTAAA GCACTCGCTG
GTAACATCAG AATACAACAC CCGCCGGGCC GAATGCGAAG CCGGGGTACG ATTTCTGCAA
ACTTTCTATC CCGAAATCAG GAGTTTACGG GATGTAACCA TGCCTATGCT CGACCAGCAT
CTGCGCGATA CGGAGCCCCT GATTTACCGT CGGTGTGCCT ATGTAGTTCA GGAAAATCAG
CGGTTACTCG ATGGTGTAGC GGCTCTGGAA GCGGACGATA TTGACACCTT TGGCCAACTC
ATGTACGGCT CTCACGAGGG GTTAAGTCAC TGGTACGAAG TGAGTTGCCC GGAGCTTGAC
ATCCTGGTGG ACATTGCCCG TGAGCAGCCG GGTGTACTGG GTGCCCGAAT GATGGGGGGC
GGTTTTGGCG GTTGCACAAT CAATCTTGTG CGCGAAGAAG CTCTTGAGGA TTTTACCAAA
TTGATTACCG AACAATATAA AGCCCAAACG GGGAAAGATA CGTACCTTCA CGTCTGCAAA
ATCCAGGACG GGACGAATGT AATCAGCTAA
 
Protein sequence
MDLLTKLTDS FQQAFPAETP GQQTPLLICS PGRVNLIGEH TDYNEGFVLP AAIDKAIYLA 
VGPRSDNELH FIAHDLNKTF QGSLTDLTPT HTWADYLLGV VAQFRQAGHQ FSGFNCVFGG
TIPMGSGLSS SAALENGVGF ALNELFQLGI DRIALVRLSQ RAENEFVGAK VGIMDMFASM
MGKADHVIKL DCRSLDYTYA PLQMNGISIV LCDSKVKHSL VTSEYNTRRA ECEAGVRFLQ
TFYPEIRSLR DVTMPMLDQH LRDTEPLIYR RCAYVVQENQ RLLDGVAALE ADDIDTFGQL
MYGSHEGLSH WYEVSCPELD ILVDIAREQP GVLGARMMGG GFGGCTINLV REEALEDFTK
LITEQYKAQT GKDTYLHVCK IQDGTNVIS