Gene Shewmr4_3408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3408 
Symbol 
ID4253974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4069966 
End bp4071111 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID638120046 
Productgalactokinase 
Protein accessionYP_735531 
Protein GI113971738 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0618615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACC CTGCGCAGCG CGCCACTAAG TTATTTGTCC AAACCTTTGG CACTAAAGCC 
GATGATTTAT ACCAAGCCCC AGGTCGGGTT AATTTGATCG GTGAATATAC GGATTACAAC
GACGGCTTCG TATTGCCCGC CGCCATTAAT TTTCATACTG TGATTGCGGT TAAACGCCGA
GACGACAATA AGTTTCGCGC CGTTGCCGAC GCCTTTCCGG GGCAAATCAA GGAATGGAGC
TTCGGTAAAG ACACCGAAAT CAATCCTGAG GATGGTTGGG TTAATTATCT CAAAGGCTTG
ACCGTGGCCA TGGCCAACAC TGGGCTTATC GCCAAAGGGT TAGACTTAGC GGTTGTCGGC
GATGTGCCAT TAGCCGCGGG TCTGTCTTCC TCCGGCGCCT TAGTCGTCGC CTTTGGCACC
GCCATTAGCG ACAGCAGCCA ACTGCATTTA TCTCCTATGG CGGTTGCACA ACTCGCTCAG
CGCGGTGAAT ATCGATATGT CTCATCGGCT TGCAGCATTA TGGACCATAT GATCTGCGCC
ATGGGCGAAC CGGATCATGC CTTGCTCATC GATTGTCTGG ATCTGGATAG CGAGCCTATT
GCGATCCCTG AAAATCTCAG CCTTATCATT ATCGATGCCC ATATCGAAAA ACAACGTCTG
GCGGCAACGA ATCAACAGCG CCGTGAAGAA TGCGCACAGG CTGCCGAGCA TTTTGGTCTC
GATGCCCTGC GCCACCTCGA CCTGCGCCAG CTCGAAAGTG CTAAAGATCA ATTGGATGAC
ACCCTGTATC GCCGCGCCAA ACACGTAGTC ACCGAAAACA AACGCACTCA GAGTGCCGCT
CGGGCGCTAG AGCAAAATAA TCTATCTAAA TTCAGTTTGT TAATGGCACA GTCCCATCAA
TCTCTGCGGG ATGATTTTGA GGTGACACTG CCCGAATTTG ACACTTTGGT GGACATAGTC
GGCCAAGTGA TTGGAGAGCG TGGCGGCATT CGCATGACCG ACGGTTGTGT CGTCGCCTTA
GTGGATCACG AACTCACCGA TGCCGTGGTC TCGGCGGTCG AGCATGCATT TTATGAACAG
ACCGGAATCG ATGCCACTGT GTATCTCTGC TCCGCGAGTG CTGGCGCGGG GCGCATCGAC
ATCTAG
 
Protein sequence
MSNPAQRATK LFVQTFGTKA DDLYQAPGRV NLIGEYTDYN DGFVLPAAIN FHTVIAVKRR 
DDNKFRAVAD AFPGQIKEWS FGKDTEINPE DGWVNYLKGL TVAMANTGLI AKGLDLAVVG
DVPLAAGLSS SGALVVAFGT AISDSSQLHL SPMAVAQLAQ RGEYRYVSSA CSIMDHMICA
MGEPDHALLI DCLDLDSEPI AIPENLSLII IDAHIEKQRL AATNQQRREE CAQAAEHFGL
DALRHLDLRQ LESAKDQLDD TLYRRAKHVV TENKRTQSAA RALEQNNLSK FSLLMAQSHQ
SLRDDFEVTL PEFDTLVDIV GQVIGERGGI RMTDGCVVAL VDHELTDAVV SAVEHAFYEQ
TGIDATVYLC SASAGAGRID I