Gene Elen_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1201 
Symbol 
ID8415492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1440749 
End bp1441981 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content73% 
IMG OID645024164 
ProductGalactokinase 
Protein accessionYP_003181560 
Protein GI257790954 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.267073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC TGGAGGATGC CCGCACTCCT GCGCCGGCGC GCCTGCGCAA GCGGTTCGCC 
GAGCGCTTCG GGAACGCGGG CGCACGGCCG CTCGCATTCG CATCCGCTCC CGGGCGCGTG
GAGCTGGCCG GCAACCATAC CGACCATCAG GGCGGGCGCA CGATCTCGGC CGCCATCGAC
CGCCGCATCT ACGCGCTCGC CGCCCCGAAC GGCACCGACG AGATGCGCGT GAGCATGGAG
AGCTTCGGCG ACATCGCGCT GAGCTGCGGC GACTTGGACG CGCGCGAAAG CGAGCGCGGG
ACGTCGCTTG CGCTCGTGCG CGGCATGGCG GCCGCCTTCG TGCGCGCGGG CGGAAGGCTT
TCCGGGTTCG ATGCGGCCAC CTGCTCCGAC ATCCCCGCAG GCGCCGGGGT CTCGTCGTCG
GCCGCGTTCG AGATGCTGGT CGGCGTGCTG CTGCGCGTGC TGTGCGACCC GACGGGCGCC
GTGCCGTGCG ACCCCGTGGC GCTGGCGTTG GAGGGCGCTC AGGTCGAGCA GGCCTACTTC
GGGAAGCCCT GCGGCGTGCA GGACCAGCTG GCCAGCGCGC AGGGCGGCGC GGCGGCCTTC
GACTTCGCGG GCGACCTGCC GCGCGTCGAG CCCATCGCCT TCGACTGGGA GGCGTGCGGC
TATGCGCTCT GCCTGGTGGA CAGCCGATGC GACCACTCCG TCCACGCGGA CGAGTACGCG
GCCGTTCCCG CCGACATGCG CGCGGTCGCG CGGCGCTTCG GATGCGAGCG GCTGGAAGAC
GTTCCCTACC CCGTCTTCCT CGCCCGGCTG GCCGACGTGC GCGCGCACCT GGGCGACCGT
GCGGCCTTGC GCGCGCTCCA CTACTTCGAG GAGACGCGGC GCGTTGCCGC GCAGCAGCGG
GCGTTGGAAT CCGGAGACAT CGAAGGGTTT CTCGAAGGCG TGCGGCAATC GGGCGCGTCG
TCGGCGCAGT TCCTGCAGAA CGTGTCGCCG CGCGGCGACG GCTTGGGCGC ACGGCAGCCG
GCCATGATGG TCCTCGCGCT GTGCGCGCAC CTCCTGGACG GGCGCGGCGC GTACCGCATC
CACGGCGGCG GGTTCGGCGG CAGCGCGCTG GCCTTCGTGC CGGCGGAGGA CATCGACGCG
TTCTGCGAGT CGATGGATGC GCTGCTGGGC TACGACGCCT GCCTGCGCGC GAAGGTAGAC
GGCCGCGGCG CGTACGCGGA GCGGATGGCC TGA
 
Protein sequence
MTTLEDARTP APARLRKRFA ERFGNAGARP LAFASAPGRV ELAGNHTDHQ GGRTISAAID 
RRIYALAAPN GTDEMRVSME SFGDIALSCG DLDARESERG TSLALVRGMA AAFVRAGGRL
SGFDAATCSD IPAGAGVSSS AAFEMLVGVL LRVLCDPTGA VPCDPVALAL EGAQVEQAYF
GKPCGVQDQL ASAQGGAAAF DFAGDLPRVE PIAFDWEACG YALCLVDSRC DHSVHADEYA
AVPADMRAVA RRFGCERLED VPYPVFLARL ADVRAHLGDR AALRALHYFE ETRRVAAQQR
ALESGDIEGF LEGVRQSGAS SAQFLQNVSP RGDGLGARQP AMMVLALCAH LLDGRGAYRI
HGGGFGGSAL AFVPAEDIDA FCESMDALLG YDACLRAKVD GRGAYAERMA