Gene Ssed_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_4020 
Symbol 
ID5612508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp4924342 
End bp4925508 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content53% 
IMG OID640934975 
Productgalactokinase 
Protein accessionYP_001475752 
Protein GI157377152 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000198069 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCGAACC CCGCACAGCG CGCGAACAAA TTATTTGTGC AAACCTTTGG CACTACGGCT 
GATGACCTTT ACCTGGCACC GGGCCGGGTA AACTTGATCG GCGAACACAC CGACTATAAC
GATGGTTTCG TTCTGCCGGC AGCTATCAAC TTCCATACCA TTGTTGCGGT AAAACGCCGT
GAGGATGATA TCTACCGCGC CGTTTCCGAT GCCTTTCCCG GTGAGATAAA AGAGTGGGCT
TTCGGCCAGG AAGGTGTCAT GACATCCGAA GATGGCTGGG TCAATTATCT CAAAGGGTTC
ACAGCGGCAA TGGCGGCATC TGGCTTACCC GCAAAAGGGT TAGATATTGC AGTCGTCGGT
AACGTTCCGC TTGGCGCCGG CCTCTCATCC TCTGCGGCTC TGGAACTCGC GTTCGGTACT
GCCGTTAATG ATTGCAGCCA GCTCAGGCTC TCTCCTCTTG CCGTCGCACA AATGGCGCAG
CGCGGTGAGA ACCAATATGT GGGATGTGCC TGCGGGATCA TGGATCAGAT GATTAGCGCC
TTAGGTGAAC AGGATCATGC CCTACTCATC GATTGTGAGG ACTTAGACAG CGAACCGGTT
CATATACCCG ATAGCCTGAG CCTCATCATA GTCAATTCCA ATGTTCAGCG TGGATTAGTC
GATTCCGAGT ACAACCTGCG CCGCGAACAA TGCGAAGAGG TGGCAAGCCA TTTTGGACTT
GATTCCCTGA GACACCTTGA ACTTTCTCAG CTTGAAGCCG CAAAATCTGA GCTGTCGGAT
GTCTGTTATC GACGTGCCCG ACATGTGCTT ACCGAGAACA GACGAACTCA GAATGCCAGT
CACGCTCTGG AAGCCGGGAA TATCACCACC TTGAGCGAGC TCATGGCTCA ATCCCATGCA
TCTATGCGAG ATGACTTTGA GATCACGGTG CCTCAAATCG ATACTCTGGT AGAGATAATC
TCTGATGTTA TCGGCACCCG TGGCGGTGTA AGAATGACTG GCGGCGGCTT TGGTGGCTGT
GTGGTTGCTC TCGTTGACCA TGATCTGACC GATGCCGTAG TCGAAGCCAT TGAAGCTGAG
TACCCTAAAC AGACAGGACT GGAACCCACT GTCTACCTCT GCTCTGCCAG CGATGGGGCC
ACCCGAATCG AGAAAGATGT CTATTAG
 
Protein sequence
MSNPAQRANK LFVQTFGTTA DDLYLAPGRV NLIGEHTDYN DGFVLPAAIN FHTIVAVKRR 
EDDIYRAVSD AFPGEIKEWA FGQEGVMTSE DGWVNYLKGF TAAMAASGLP AKGLDIAVVG
NVPLGAGLSS SAALELAFGT AVNDCSQLRL SPLAVAQMAQ RGENQYVGCA CGIMDQMISA
LGEQDHALLI DCEDLDSEPV HIPDSLSLII VNSNVQRGLV DSEYNLRREQ CEEVASHFGL
DSLRHLELSQ LEAAKSELSD VCYRRARHVL TENRRTQNAS HALEAGNITT LSELMAQSHA
SMRDDFEITV PQIDTLVEII SDVIGTRGGV RMTGGGFGGC VVALVDHDLT DAVVEAIEAE
YPKQTGLEPT VYLCSASDGA TRIEKDVY