Gene Slin_2594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2594 
Symbol 
ID8726339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3134954 
End bp3136999 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content53% 
IMG OID 
Producttransketolase 
Protein accessionYP_003387411 
Protein GI284037481 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.123442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACTC AATCGGTCAA CCTCGACCAA CTCAGTATCA ATACCATCCG GCTTTTGTCG 
GTTGATGCGG TTCAGAAAGC CAATTCCGGA CACCCTGGCT TGCCGCTTGG CGCGGCTCCC
ATGGCGTATG TTCTCTGGTC GCGCTTTTTG CGGTTTAATC CACAAGACCC GCACTGGCCG
GATCGGGATC GGTTTGTGCT CTCTGCCGGG CACGGTTCGG CTTTGTTGTA CAGCTTATTG
CATTTGTATG GGTACGATCT GTCGCTTGAT GATATAAAAG GTTTCCGGCA AATTCATTCC
CGCACACCGG GTCACCCCGA GTCGAACCTA ACACCGGGAG TTGAAGTAAC CACTGGCCCG
CTTGGACAAG GGTTTGCCAA CGGGGTAGGC ATGGCCATGG CCGAAGCATT TTTGGCCGCA
GCCTACAATC GGGAAGGACA CACAGTCATG GACCATTATA CCTACTCCAT TGTGAGTGAT
GGCGATTTAA TGGAAGGGAT TGCGTCTGAA GCGGCTTCGC TGGCGGGCCA CCTTAAGTTG
GGGAAGCTGA TTTATTTGTA CGACGATAAC CTCATTTCGC TGGATGGGCC TACTAATCTA
GCGTTTACGG AAGACCGAAT GGCGCGTTTC GATGCGTATG GCTGGCATAC GCAGCATGTG
GCCGATGGCA ACGATCTGGA CGCTATTGAA GCGGCCATTC GCGCAGCCCA GGCCGAGACG
GATCGCCCGT CTATCATTGC CGTCCGTACG GTTATCGGCT TTGGCAGCCC AATGGAAGGA
ACCAGCAAAG TACACGGTAG CCCGCTGGGC GATGAAAATC TACGGAAAAC CAAAGCGTTT
TATGGTTTCG ACCCAGACCA GTCATTTGTC ATTCCGGATG AAGTAAAACC TCATTTGTTG
GAAGCAGGCA AGCGGGGTGC CGAGCTTCAG GCCGACTGGC AAAAACGGTT TGAGGCTTAC
AGAAACCAGT TCTCGGATCA GGCAGAGCTA TTTGACGTGT CATTTGCGGG TAAGTTCCCC
GACGATTGGG AAACCGATCT GCCCAAGTTT GCACCTGCTG ATGGCCCACT GGCCACCCGG
CAGGCCTCTG GCAAAGCCCT GGAAGCCCTG AAAAAACGAG TACCTTATCT CTTTGGTGGT
TCCGCCGATC TGGCTTCATC CAATGAGATG CCAACGAAAG GCGACATTAG TTTTCAGCCC
GGCCATTACG GAAACTCCAA CATCTGGTTT GGGGTACGTG AGCATGCCAT GGGAGCAGCC
CTGAACGGAA TGGCCCAGCA CGGTGGCGTG CACCCGTACG GCGGCACATT CCTCAACTTT
TCCGATTACA TGCGGGGAGC CATCCGGCTA ACGGCGTTGG CGGAATCGTC GGCGACGTTT
GTATTTACGC ACGACAGCAT TGGCCTGGGT GAAGACGGAC CCACACACCA ACCCGTTGAA
CAGGTCGTTT CGCTACGAAC CATACCAAAC ATTATTGTTT TGCGGCCGGC CGATGCCAAC
GAAACCGTTG AAGCCTGGCG AGTGGCCCTG CAACAGCCAA AGACACCCGT AGTACTAATA
CTCTCCCGGC AGAAACTGCC CGTGCTGGAT CAGGAAAAAT ACGGCTCGGC ACGTGGCCTG
GAGAAGGGAG CTTATATTTT AAGCGAAGCC GATGGTACGC CCGAGCTCAT ATTGATTGCC
ACAGGTTCTG AAGTGTCGTT GGTGCTGGAA GCGCAAGAGG AGCTAAAGAA ACAGGGCATT
CAGGCGCGGG TTGTTAGCAT GCCTTCATGG GAGTTGTTCG AAAAGCAAGA TCAGGCCTAT
CACCACGAAG TATTGCCGCC CTCGATTCGG AAGCGGCTTG CCGTAGAAAT GGGCTCGCCA
ATTGGCTGGC ATAAATACGT GACAGATGAA GGAACAACGA TTAGTATGAA CCGATTTGGC
TTGTCCGGCC CCGCCGAAGA AGTAATGGCT TACTTTGGCT TTACGGTGGA AAATGTAGTA
AACACGGCTA AATCGGTACT GGACGGCAAT CCTGACGGTA TTGAGAAAAA AGAAGTATTG
TCCTGA
 
Protein sequence
MTTQSVNLDQ LSINTIRLLS VDAVQKANSG HPGLPLGAAP MAYVLWSRFL RFNPQDPHWP 
DRDRFVLSAG HGSALLYSLL HLYGYDLSLD DIKGFRQIHS RTPGHPESNL TPGVEVTTGP
LGQGFANGVG MAMAEAFLAA AYNREGHTVM DHYTYSIVSD GDLMEGIASE AASLAGHLKL
GKLIYLYDDN LISLDGPTNL AFTEDRMARF DAYGWHTQHV ADGNDLDAIE AAIRAAQAET
DRPSIIAVRT VIGFGSPMEG TSKVHGSPLG DENLRKTKAF YGFDPDQSFV IPDEVKPHLL
EAGKRGAELQ ADWQKRFEAY RNQFSDQAEL FDVSFAGKFP DDWETDLPKF APADGPLATR
QASGKALEAL KKRVPYLFGG SADLASSNEM PTKGDISFQP GHYGNSNIWF GVREHAMGAA
LNGMAQHGGV HPYGGTFLNF SDYMRGAIRL TALAESSATF VFTHDSIGLG EDGPTHQPVE
QVVSLRTIPN IIVLRPADAN ETVEAWRVAL QQPKTPVVLI LSRQKLPVLD QEKYGSARGL
EKGAYILSEA DGTPELILIA TGSEVSLVLE AQEELKKQGI QARVVSMPSW ELFEKQDQAY
HHEVLPPSIR KRLAVEMGSP IGWHKYVTDE GTTISMNRFG LSGPAEEVMA YFGFTVENVV
NTAKSVLDGN PDGIEKKEVL S