Gene ECD_04034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04034 
SymbolyjeF 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4300586 
End bp4302133 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content57% 
IMG OID 
Productpredicted carbohydrate kinase 
Protein accessionACT45823 
Protein GI253980153 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00166005 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACC ATACAATGAA GAAAAACCCC GTAAGTATAC CACACACCGT CTGGTACGCC 
GACGATATCC GCCGCGGAGA ACGCGAGGCG GCAGATGCGC TGGGGCTCAC ACTCTATGAG
CTGATGCTTC GCGCTGGCGA AGCAGCATTC CAGGTGTGTC GTTCGGCTTA TCCTGACGCC
CGCCACTGGC TGGTGCTGTG CGGTCATGGT AATAACGGCG GCGATGGCTA TGTGGTCGCG
CGGCTGGCCA CAGCGGTCGG TATTGAGGTC ACGCTGCTGG CCCAGGAAAG TGACAAACCG
TTGCCGGAAG AGGCCGCGCT GGCACGCGAA GCATGGTTAA ACGCGGGTGG CGAGATCCAT
GCTTCGAATA TTGTCTGGCC CGAATCGGTA GATCTGATTG TTGATGCGCT GCTCGGTACC
GGCTTGCAGC AAGCGCCCCG CGAATCCATT AGCCAGTTAA TCGACCACGC TAATTCCCAT
CCTGCGCCGA TTGCGGCGGT TGATATCCCT TCCGGCCTGC TGGCTGAAAC CGGCGCTACG
CCAGGCGCAG TGATCAACGC CGATCACACC ATCACTTTTA TTGCGCTGAA ACCAGGCTTG
CTCACTGGAA AAGCGCGGGA TGTTACAGGA CAACTGCATT TTGACTCACT GGGGCTGGAT
AGCTGGCTGG CAGGTCAGGA GACGAAAATT CAGCGGTTTT CGGCAGAACA ACTTTCTCAC
TGGCTGAAAC CGCGTCGCCC GACTTCGCAT AAAGGCGATC ACGGGCGGCT GGTGATTATC
GGTGGCGATC ACGGCACTGC GGGGGCTATT CGTATGACGG GGGAAGCGGC GCTACGTGCT
GGTGCTGGTT TAGTCCGAGT ACTGACCCGC AGTGAAAACA TTGCGCCGCT GCTGACTGCA
CGACCAGAAT TGATGGTGCA TGAACTGACG ATGGACTCTC TTGCCGAAAG CCTGGAATGG
GCCGATGTGG TGGTGATTGG TCCCGGTCTG GGCCAGCAAG AGTGGGGGAA AAAAGCCCTG
CAAAAAGTTG AGAATTTTCG CAAACCGATG TTGTGGGATG CCGATGCATT GAACCTGCTG
GCAATCAATC CCGATAAGCG TCACAATCGC GTGATCACGC CGCATCCTGG CGAGGCCGCA
CGGTTGTTAG GCTGTTCCGT CGCTGAAATT GAAAGTGACC GCTTACATTG CGCCAAACGT
CTGGTACAAC GTTATGGCGG CGTAGCGGTG CTGAAAGGTG CCGGAACCGT GGTCGCCGCC
CATCCTGACG CTTTAGGCAT TATTGATGTC GGAAATGCAG GCATGGCGAG CGGCGGCATG
GGCGATGTGC TCTCTGGTAT TATTGGCGCA TTGCTTGGGC AAAAACTGTC GCCGTATGAT
GCAGCCTGTG CAGGCTGTGT TGCGCACGGT GCGGCAGCTG ACGTACTGGC GGCGCGTTTT
GGAACGCGCG GGATGCTGGC AACCGATCTC TTTTCCACGC TACAGCGTAT TGTTAACCCG
GAAGTGACTG ATAAAAACCA TGATGAATCG AGTAATCCCG CTCCCTGA
 
Protein sequence
MTDHTMKKNP VSIPHTVWYA DDIRRGEREA ADALGLTLYE LMLRAGEAAF QVCRSAYPDA 
RHWLVLCGHG NNGGDGYVVA RLATAVGIEV TLLAQESDKP LPEEAALARE AWLNAGGEIH
ASNIVWPESV DLIVDALLGT GLQQAPRESI SQLIDHANSH PAPIAAVDIP SGLLAETGAT
PGAVINADHT ITFIALKPGL LTGKARDVTG QLHFDSLGLD SWLAGQETKI QRFSAEQLSH
WLKPRRPTSH KGDHGRLVII GGDHGTAGAI RMTGEAALRA GAGLVRVLTR SENIAPLLTA
RPELMVHELT MDSLAESLEW ADVVVIGPGL GQQEWGKKAL QKVENFRKPM LWDADALNLL
AINPDKRHNR VITPHPGEAA RLLGCSVAEI ESDRLHCAKR LVQRYGGVAV LKGAGTVVAA
HPDALGIIDV GNAGMASGGM GDVLSGIIGA LLGQKLSPYD AACAGCVAHG AAADVLAARF
GTRGMLATDL FSTLQRIVNP EVTDKNHDES SNPAP