Gene Sde_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0472 
Symbol 
ID3967990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp575670 
End bp577673 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content50% 
IMG OID637919535 
Producttransketolase 
Protein accessionYP_525948 
Protein GI90020121 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0841196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCAC GCAGAGAACT TGCGAATGCC ATCCGTGCAT TAAGTATGGA CGCCGTACAA 
AAAGCAAATA GTGGTCACCC AGGTGCACCA ATGGGAATGG CCGATATTGC GGAAGTGCTG
TGGAATGACT TCCTCAAGCA CAACCCAAGC AACCCTAATT GGGCCGATCG CGACCGTTTT
GTATTGTCAA ACGGCCACGG CTCTATGCTT ATTTATTCTT TGTTGCACCT TAGCGGTTAC
GACTTGCCAA TGAGCGAGTT GGCTCAGTTC CGTCAGTTGC ACTCTAAAAC CCCAGGTCAC
CCAGAGTTGG GTTATACCCC AGGTGTAGAA ACTACCACTG GCCCACTTGG CCAAGGTATC
AGTAACGCCG TAGGTATGGC GCTTGCCGAA AAAATGTTGG CTGCGCAGTT TAACCGCGAT
GGTCACGATA TTGTTGATCA CTACACCTAC TGCTTTTTAG GTGATGGCTG TTTGATGGAA
GGTGTTTCTC ACGAAACCTG TTCACTTGCT GGCACCTTGG GCTTGGGCAA ATTAATTGCC
TTTTGGGACG ACAACGGTAT TTCTATTGAT GGTCACGTAG AAGGCTGGTT TACCGATAAT
ACCCCAGCAC GTTTTGAAGC TTATGGCTGG CATGTTATCC CAGCGGTAGA TGGTCACGAC
CCAGAAGCTA TTAAAGCGGC GGTAGAAGCT GCGCAAGCAG AAACAGGCAA GCCAACGTTG
ATTTGTACTA AAACCACCAT TGGTTTTGGT TCACCTAACA AGAGCGGTTC TCACGATTGT
CACGGCGCAC CATTAGGCGA TGCAGAAATT GCCGCAGCGC GCGAATTTTT GGGCTGGCCT
CATGCGCCAT TCGAAATTCC AGACAACGTT TACGCTGGTT GGGATGCGAA AGAAAAAGGT
GCAGCGGCAC AGTCTGCATG GGAAGCCAAG TTTGAAGCAT ACAAGGCTGC ACAGCCCGCT
TTGGCGGCCG AGTTTGAGCG CCGCGTATTA AACGGTGACT TGCCTGCAGA TTTCGAAGCT
AAAGCCGATG CGTTTATTAA AGCTGTTAAC GAAAAAGGCG AAAGCATTGC CACCCGTAAA
GCGTCACAAA ACACTATTGC AGAATTTGGC GCTGCATTGC CAGAGCTACT CGGTGGCTCT
GCCGATTTGG CTGGCTCTAA CCTCACCATG TGGAGCGGTT CTAAGCCTGT TACGCGCGAA
GATGCCAGCG GCAACTATAT TTACTACGGT GTGCGTGAAT TTGGTATGAG CGCCATTATG
AATGGTATTG CTGCTCACGG TGGCTTTATT AACTACGGCG CAACCTTCTT AATGTTTATG
GAATACGCGC GCAACGCTGT GCGTATGTCT GCATTGATGA AGTTGCCAAA TATTTTTGTT
TACACCCACG ATTCCATTGG TCAGGGTGAA GACGGCCCAA CTCACCAGCC TATCGAGCAG
TTGGCCGCGT TGCGTTTAAC GCCAAACCTA AATACTTGGC GCCCAGCGGA TGCAGTTGAA
TCTGCGGTTG CTTGGAAATC TGCAGTAATG CGTAAAGACG GCCCAAGCGC ATTGGTATTT
ACCCGTCAAG GCGTTAAGGC GCAAAGCCAC GACGACGAGC AAATAGCCAA CATGGCGCGC
GGTGCATACG TACTAGTAGA TTGCGACGGC GAGCCAGAAG TGATGCTAAT TGCCACTGGT
TCTGAAGTGG GTATTACCGT AGATGCCGCA GCACAGCTAG CGGGTGAAGG CGTGAAGGTG
CGTGTTGTAT CCATGCCTTG TACCAATGTG TTCGATCAGC AAGATGCCGC CTACAAAGAA
TCTGTATTAC CTATTGCGGT TACCCATCGC GTAGCGGTAG AAACGTCACA CGTTGACTAC
TGGGCGAAAT ACGTAGGCAT TGATGGCCGC GTAGTGGGCA TGACCACCTT CGGTGAATCT
GCACCGGGTG GCGCATTGCT TGAGTACTTC GGTTTTACTG TAGAAAACGT CGTAAACACC
GTTAAAGAAT TGCTAGAAGA CTAA
 
Protein sequence
MSSRRELANA IRALSMDAVQ KANSGHPGAP MGMADIAEVL WNDFLKHNPS NPNWADRDRF 
VLSNGHGSML IYSLLHLSGY DLPMSELAQF RQLHSKTPGH PELGYTPGVE TTTGPLGQGI
SNAVGMALAE KMLAAQFNRD GHDIVDHYTY CFLGDGCLME GVSHETCSLA GTLGLGKLIA
FWDDNGISID GHVEGWFTDN TPARFEAYGW HVIPAVDGHD PEAIKAAVEA AQAETGKPTL
ICTKTTIGFG SPNKSGSHDC HGAPLGDAEI AAAREFLGWP HAPFEIPDNV YAGWDAKEKG
AAAQSAWEAK FEAYKAAQPA LAAEFERRVL NGDLPADFEA KADAFIKAVN EKGESIATRK
ASQNTIAEFG AALPELLGGS ADLAGSNLTM WSGSKPVTRE DASGNYIYYG VREFGMSAIM
NGIAAHGGFI NYGATFLMFM EYARNAVRMS ALMKLPNIFV YTHDSIGQGE DGPTHQPIEQ
LAALRLTPNL NTWRPADAVE SAVAWKSAVM RKDGPSALVF TRQGVKAQSH DDEQIANMAR
GAYVLVDCDG EPEVMLIATG SEVGITVDAA AQLAGEGVKV RVVSMPCTNV FDQQDAAYKE
SVLPIAVTHR VAVETSHVDY WAKYVGIDGR VVGMTTFGES APGGALLEYF GFTVENVVNT
VKELLED