Gene Sde_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1097 
Symbol 
ID3968261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1407158 
End bp1408315 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content48% 
IMG OID637920165 
Productgalactokinase 
Protein accessionYP_526571 
Protein GI90020744 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0010459 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.419761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA TAATTCAAGA CGTTGCCGCT CTATTTGAAA CGCATTTTAA CGCTCTCCAC 
GAAGTACTCT TCCACGCGCC TGGGCGCGTA AACTTAATTG GGGAACACAC AGACTACAAC
AACGGCTTTG TGCTGCCATG CGCAATCGAC AGGGGCACGT ACCTTGCGAT TAAAACACGC
GAAGACAACT TGATTCGCGT TGTTGCAGGC AACTTAAGCA ATGCCGCGAG CGAATGGCCA
GCATCATTGC CAGTTGAGCA CGACAAAAAT AACGCATGGG CCGATTATAT TCGCGGCGTA
ACAGAGCAAC TACTGAAACA AGGTCACACA CTAAAAGGTA TGGACATTGC CGTACTGGGG
AATGTCCCTC AGGGAGCAGG CCTTAGTTCG TCTGCCTCTT TTTCTGTGGG ATTCGCCACG
GCGTGCAACG CTATTAATAC ACTGGGACTT TCGCCAACTG AAGTAGCGCT ATGCTGCCAA
GCGGCCGAGA ACGAATTTGC AGGATGCAAT TGCGGCATTA TGGATCAACT TATTTCCGCT
GCCGGCGAAG CAGGACACGC GTTGCTAATA AATTGCGGCG ATTACAGCTA TGAGCCTTAC
GCAATTCCTG AAGACCTAGC TATTATGATC ATAGACAGCA AAGTTAAGCG CGGTTTAGTA
GATAGCGAAT ACAACACTCG ACGCAAACAA TGTGAAGAAG CCGCATTGAT TATGGGCGTA
AGTAGTTTGC GCGATGCCAC CCTTTCTTTG CTAGCGGAAA GCAAAAACAA AATGACGGAC
GAGGTTTTTC GCCGCGCGAA ACATGTAATA ACAGAAAATC AACGCACCAT TGACGCAGCG
GAAGCGCTAG CCAACAAGAA CTACACATTG TTAAATAAAC TAATGGCCGA ATCACATATA
TCCATGCGCG ATGACTTTGA GGTAACCACC TCGCAAATAG ACTTACTTGT TGACTTAGTT
GGCGAGCACT TGGATAACGA CGGCGGTGTG AGGATGACCG GCGGAGGGTT TGGTGGGTGT
GTGGTGGCTT TGGTGCCCAA AGTAAAAGCA GAAGCAATCT CCAACGCAAT ACTTAAACCA
TATAAAGAAG CGACAAATTT AGACGCAGAG ACCCATATTT GTTTAGCGTC TGCGGGAGCA
GCTAGCCTAA ACACCTAA
 
Protein sequence
MKTIIQDVAA LFETHFNALH EVLFHAPGRV NLIGEHTDYN NGFVLPCAID RGTYLAIKTR 
EDNLIRVVAG NLSNAASEWP ASLPVEHDKN NAWADYIRGV TEQLLKQGHT LKGMDIAVLG
NVPQGAGLSS SASFSVGFAT ACNAINTLGL SPTEVALCCQ AAENEFAGCN CGIMDQLISA
AGEAGHALLI NCGDYSYEPY AIPEDLAIMI IDSKVKRGLV DSEYNTRRKQ CEEAALIMGV
SSLRDATLSL LAESKNKMTD EVFRRAKHVI TENQRTIDAA EALANKNYTL LNKLMAESHI
SMRDDFEVTT SQIDLLVDLV GEHLDNDGGV RMTGGGFGGC VVALVPKVKA EAISNAILKP
YKEATNLDAE THICLASAGA ASLNT