Gene EcolC_2905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2905 
Symbol 
ID6065384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3165588 
End bp3166736 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content53% 
IMG OID641602310 
Productgalactokinase 
Protein accessionYP_001725859 
Protein GI170020905 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.233357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA AAGAAAAAAC ACAATCTCTG TTTGCCAACG CATTTGGCTA CCCTGCCACT 
CACACCATTC AGGCGCCTGG CCGCGTGAAT TTGATTGGTG AACACACCGA CTACAACGAC
GGTTTCGTTC TGCCCTGCGC GATTGATTAT CAAACCGTGA TCAGTTGTGC ACCACGCGAT
GACCGTAAAG TTCGCGTGAT GGCAGCCGAT TATGAAAATC AGCTCGACGA GTTTTCCCTC
GATGCGCCCA TTGTCGCACA TGAAAACTAT CAATGGGCTA ACTACGTTCG TGGCGTGGTG
AAACATCTGC AACTGCGTAA CAACAGCTTC GGCGGCGTGG ACATGGTGAT CAGCGGCAAT
GTGCCGCAGG GTGCCGGGTT AAGTTCTTCC GCTTCACTGG AAGTCGCGGT CGGAACCGTA
TTGCAGCAGC TTTATCATCT GCCGCTGGAC GGCGCACAAA TCGCGCTTAA CGGTCAGGAA
GCAGAAAACC AGTTTGTAGG CTGTAACTGC GGGATCATGG ATCAGCTAAT TTCCGCGCTC
GGCAAGAAAG ATCATGCCTT GCTGATCGAT TGCCGCTCAC TGGGGACCAA AGCAGTTTCC
ATGCCCAAAG GTGTGGCTGT CGTCATCATC AACAGTAACT TCAAACGTAC CCTGGTTGGC
AGCGAATACA ACACCCGTCG TGAACAGTGC GAAACCGGTG CGCGTTTCTT CCAGCAGCCA
GCCCTGCGTG ATGTCACCAT TGAAGAGTTC AACGCTGTTG CGCATGAACT GGACCCGATC
GTGGCAAAAC GCGTGCGTCA TATACTGACT GAAAACGCCC GCACCGTTGA AGCTGCCAGC
GCGCTGGAGC AAGGCGACCT GAAACGTATG GGCGAGTTGA TGGCGGAGTC TCATGCCTCT
ATGCGCGATG ATTTCGAAAT CACCGTGCCG CAAATTGACA CTCTGGTAGA AATCGTCAAA
GCTGTGATTG GCGACAAAGG TGGCGTACGC ATGACCGGCG GCGGATTTGG CGGCTGTATC
GTCGCGCTGA TCCCGGAAGA GCTGGTGCCT GCCGTACAGC AAGCTGTCGC TGAACAATAT
GAAGCAAAAA CAGGTATTAA AGAGACTTTT TACGTTTGTA AACCATCACA AGGAGCAGGA
CAGTGCTGA
 
Protein sequence
MSLKEKTQSL FANAFGYPAT HTIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAPRD 
DRKVRVMAAD YENQLDEFSL DAPIVAHENY QWANYVRGVV KHLQLRNNSF GGVDMVISGN
VPQGAGLSSS ASLEVAVGTV LQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL
GKKDHALLID CRSLGTKAVS MPKGVAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP
ALRDVTIEEF NAVAHELDPI VAKRVRHILT ENARTVEAAS ALEQGDLKRM GELMAESHAS
MRDDFEITVP QIDTLVEIVK AVIGDKGGVR MTGGGFGGCI VALIPEELVP AVQQAVAEQY
EAKTGIKETF YVCKPSQGAG QC