Gene EcDH1_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1021 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1089747 
End bp1091027 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID 
Product4-aminobutyrate aminotransferase 
Protein accessionACX38702 
Protein GI260448280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGCA ATAAAGAGTT AATGCAGCGC CGCAGTCAGG CGATTCCCCG TGGCGTTGGG 
CAAATTCACC CGATTTTCGC TGACCGCGCG GAAAACTGCC GGGTGTGGGA CGTTGAAGGC
CGTGAGTATC TTGATTTCGC GGGCGGGATT GCGGTGCTCA ATACCGGGCA CCTGCATCCG
AAGGTGGTGG CCGCGGTGGA AGCGCAGTTG AAAAAACTGT CGCACACCTG CTTCCAGGTG
CTGGCTTACG AGCCGTATCT GGAGCTGTGC GAGATTATGA ATCAGAAGGT GCCGGGCGAT
TTCGCCAAGA AAACGCTGCT GGTTACGACC GGTTCCGAAG CGGTGGAAAA CGCGGTAAAA
ATCGCCCGCG CCGCCACCAA ACGTAGCGGC ACCATCGCTT TTAGCGGCGC GTATCACGGG
CGCACGCATT ACACGCTGGC GCTGACCGGC AAGGTGAATC CGTACTCTGC GGGCATGGGG
CTGATGCCGG GTCATGTTTA TCGCGCGCTT TATCCTTGCC CGCTGCACGG CATAAGCGAG
GATGACGCTA TCGCCAGCAT CCACCGGATC TTCAAAAATG ATGCCGCGCC GGAAGATATC
GCCGCCATCG TGATTGAGCC GGTTCAGGGC GAAGGCGGTT TCTACGCCTC GTCGCCAGCC
TTTATGCAGC GTTTACGCGC TCTGTGTGAC GAGCACGGGA TCATGCTGAT TGCCGATGAA
GTGCAGAGCG GCGCGGGGCG TACCGGCACG CTGTTTGCGA TGGAGCAGAT GGGCGTTGCG
CCGGATCTTA CCACCTTTGC GAAATCGATC GCGGGCGGCT TCCCGCTGGC GGGCGTCACC
GGGCGCGCGG AAGTAATGGA TGCCGTCGCT CCAGGCGGTC TGGGCGGCAC CTATGCGGGT
AACCCGATTG CCTGCGTGGC TGCGCTGGAA GTGTTGAAGG TGTTTGAGCA GGAAAATCTG
CTGCAAAAAG CCAACGATCT GGGGCAGAAG TTGAAAGACG GATTGCTGGC GATAGCCGAA
AAACACCCGG AGATCGGCGA CGTACGCGGG CTGGGGGCGA TGATCGCCAT TGAGCTGTTT
GAAGACGGCG ATCACAACAA GCCGGACGCC AAACTCACCG CCGAGATCGT GGCTCGCGCC
CGCGATAAAG GCCTGATTCT TCTCTCCTGC GGCCCGTATT ACAACGTGCT GCGCATCCTT
GTACCGCTCA CCATTGAAGA CGCTCAGATC CGTCAGGGTC TGGAGATCAT CAGCCAGTGT
TTTGATGAGG CGAAGCAGTA G
 
Protein sequence
MNSNKELMQR RSQAIPRGVG QIHPIFADRA ENCRVWDVEG REYLDFAGGI AVLNTGHLHP 
KVVAAVEAQL KKLSHTCFQV LAYEPYLELC EIMNQKVPGD FAKKTLLVTT GSEAVENAVK
IARAATKRSG TIAFSGAYHG RTHYTLALTG KVNPYSAGMG LMPGHVYRAL YPCPLHGISE
DDAIASIHRI FKNDAAPEDI AAIVIEPVQG EGGFYASSPA FMQRLRALCD EHGIMLIADE
VQSGAGRTGT LFAMEQMGVA PDLTTFAKSI AGGFPLAGVT GRAEVMDAVA PGGLGGTYAG
NPIACVAALE VLKVFEQENL LQKANDLGQK LKDGLLAIAE KHPEIGDVRG LGAMIAIELF
EDGDHNKPDA KLTAEIVARA RDKGLILLSC GPYYNVLRIL VPLTIEDAQI RQGLEIISQC
FDEAKQ