Gene Ent638_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1248 
Symbol 
ID5114210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1373777 
End bp1374925 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content56% 
IMG OID640491435 
Productgalactokinase 
Protein accessionYP_001175980 
Protein GI146310906 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.4259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCA AAGATAAAAC ACAATCCCTG TTTGCTGAAA CATTCGGCTA CCCTGCCACC 
CACGCAATTC AGGCGCCTGG CCGCGTGAAC CTGATTGGTG AGCACACCGA TTACAACGAC
GGTTTTGTGC TGCCATGTGC GATCGATTAT CAAACTGTTA TCAGCTGTGC AAAACGCGAT
GACCGTATCG TGCGCGTCAT TGCGGCAGAT TACGATAATC AAACCGACGA GTTTTCGCTC
GACGAGCCGA TCGTGGCACA CGATACGCAG CAGTGGTCTA ACTACGTACG TGGCGTGGTG
AAGCATCTGC AAATGCGTAA TAAGGGCTTT GGCGGCGCGG ACCTGGTGAT CGCCGGTAAC
GTGCCGCAGG GCGCGGGGTT AAGCTCTTCT GCGTCTCTTG AAGTGGCCGT TGGGACGGTC
TTCCAGCAGT TGTATCACCT GCCGCTGGAC GGCGCGCAAA TAGCCCTGAA TGGCCAGGAA
GCTGAGAACC AGTTCGTGGG CTGCAACTGC GGCATCATGG ACCAGCTGAT CTCTGCTCTT
GGTAAAAAAG AGCACGCGCT ACTGATCGAC TGCCGCTCGC TCGGCACCAA AGCGGTTCCC
CTGCCAAAAG GCGCGGCGGT GGTGATCATC AACAGTAATT TCAAACGCAC GCTGGTGGGC
AGCGAATACA ACACCCGCCG CGAGCAGTGC GAAACCGGGG CGCGTTTCTT CCAACAACCG
GCGCTGCGTG ATGTCTCTCT AAACGAGTTC AATAAAGTGG CTCACGAGCT GGATCCCGTT
GTGACCAAAC GCGTTCGCCA CGTGTTAACC GAAAATGCAC GCACCGTTGA AGCCGCGTCA
GCACTGGCGC AGGGCGATTT GAAACGGATG GGCGAACTGA TGGCTGAATC GCACGCGTCA
ATGCGCGACG ACTTCGAAAT CACTGTTCCG CAAATCGACA CGCTGGTGGA GATCGTCAAA
GCGACTATCG GCGACAAAGG CGGCGTACGC ATGACCGGTG GCGGCTTCGG CGGTTGTGTT
GTCGCCCTCA TCCCGGAAGA GTGGGTCCCT GCCGTTCAGG ACGCCGTTTC ACAGCAATAT
GAAGCGAAAA CCGGAATCAA AGAAACCTTC TACGTCTGCA AACCTTCACA AGGAGCGGGT
CAGTGCTAA
 
Protein sequence
MSLKDKTQSL FAETFGYPAT HAIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAKRD 
DRIVRVIAAD YDNQTDEFSL DEPIVAHDTQ QWSNYVRGVV KHLQMRNKGF GGADLVIAGN
VPQGAGLSSS ASLEVAVGTV FQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL
GKKEHALLID CRSLGTKAVP LPKGAAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP
ALRDVSLNEF NKVAHELDPV VTKRVRHVLT ENARTVEAAS ALAQGDLKRM GELMAESHAS
MRDDFEITVP QIDTLVEIVK ATIGDKGGVR MTGGGFGGCV VALIPEEWVP AVQDAVSQQY
EAKTGIKETF YVCKPSQGAG QC