Gene EcDH1_2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2344 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2512862 
End bp2514127 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content58% 
IMG OID 
Product4-aminobutyrate aminotransferase 
Protein accessionACX39987 
Protein GI260449565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00720647 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACA ATGAATTCCA TCAGCGTCGT CTTTCTGCCA CTCCGCGCGG GGTTGGCGTG 
ATGTGTAACT TCTTCGCCCA GTCGGCTGAA AACGCCACGC TGAAGGATGT TGAGGGCAAC
GAGTACATCG ATTTCGCCGC AGGCATTGCG GTGCTGAATA CCGGACATCG CCACCCTGAT
CTGGTCGCGG CGGTGGAGCA GCAACTGCAA CAGTTTACCC ACACCGCGTA TCAGATTGTG
CCGTATGAAA GCTACGTCAC CCTGGCGGAG AAAATCAACG CCCTTGCCCC GGTGAGCGGG
CAGGCCAAAA CCGCGTTCTT CACCACCGGT GCGGAAGCGG TGGAAAACGC GGTGAAAATT
GCTCGCGCCC ATACCGGACG CCCTGGCGTG ATTGCGTTTA GCGGCGGCTT TCACGGTCGT
ACGTATATGA CCATGGCGCT GACCGGAAAA GTTGCGCCGT ACAAAATCGG CTTCGGCCCG
TTCCCTGGTT CGGTGTATCA CGTACCTTAT CCGTCAGATT TACACGGCAT TTCAACACAG
GACTCCCTCG ACGCCATCGA ACGCTTGTTT AAATCAGACA TCGAAGCGAA GCAGGTGGCG
GCGATTATTT TCGAACCGGT GCAGGGCGAG GGCGGTTTCA ACGTTGCGCC AAAAGAGCTG
GTTGCCGCTA TTCGCCGCCT GTGCGACGAG CACGGTATTG TGATGATTGC TGATGAAGTG
CAAAGCGGCT TTGCGCGTAC CGGTAAGCTG TTTGCCATGG ATCATTACGC CGATAAGCCG
GATTTAATGA CGATGGCGAA AAGCCTCGCG GGCGGGATGC CGCTTTCGGG CGTGGTCGGT
AACGCGAATA TTATGGACGC ACCCGCGCCG GGCGGGCTTG GCGGCACCTA CGCCGGTAAC
CCGCTGGCGG TGGCTGCCGC GCACGCGGTG CTCAACATTA TCGACAAAGA ATCACTCTGC
GAACGCGCGA ATCAACTGGG CCAGCGTCTC AAAAACACGT TGATTGATGC CAAAGAAAGC
GTTCCGGCCA TTGCTGCGGT ACGCGGCCTG GGGTCGATGA TTGCGGTAGA GTTTAACGAT
CCGCAAACGG GCGAGCCGTC AGCGGCGATT GCACAGAAAA TCCAGCAACG CGCGCTGGCG
CAGGGGCTGC TCCTGCTGAC CTGTGGCGCA TACGGCAACG TGATTCGCTT CCTGTATCCG
CTGACCATCC CGGATGCGCA ATTCGATGCG GCAATGAAAA TTTTGCAGGA TGCGCTGAGC
GATTAA
 
Protein sequence
MSNNEFHQRR LSATPRGVGV MCNFFAQSAE NATLKDVEGN EYIDFAAGIA VLNTGHRHPD 
LVAAVEQQLQ QFTHTAYQIV PYESYVTLAE KINALAPVSG QAKTAFFTTG AEAVENAVKI
ARAHTGRPGV IAFSGGFHGR TYMTMALTGK VAPYKIGFGP FPGSVYHVPY PSDLHGISTQ
DSLDAIERLF KSDIEAKQVA AIIFEPVQGE GGFNVAPKEL VAAIRRLCDE HGIVMIADEV
QSGFARTGKL FAMDHYADKP DLMTMAKSLA GGMPLSGVVG NANIMDAPAP GGLGGTYAGN
PLAVAAAHAV LNIIDKESLC ERANQLGQRL KNTLIDAKES VPAIAAVRGL GSMIAVEFND
PQTGEPSAAI AQKIQQRALA QGLLLLTCGA YGNVIRFLYP LTIPDAQFDA AMKILQDALS
D