Gene EcDH1_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2020 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2176349 
End bp2177521 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content51% 
IMG OID 
Productaminotransferase class I and II 
Protein accessionACX39677 
Protein GI260449255 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.07254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGATT TTTCAAAGGT CGTGGATCGT CATGGCACAT GGTGTACACA GTGGGATTAT 
GTCGCTGACC GTTTCGGCAC TGCTGACCTG TTACCGTTCA CGATTTCAGA CATGGATTTT
GCCACTGCCC CCTGCATTAT CGAGGCGCTG AATCAGCGCC TGATGCACGG CGTATTTGGC
TACAGCCGCT GGAAAAACGA TGAGTTTCTC GCGGCTATTG CCCACTGGTT TTCCACCCAG
CATTACACCG CCATCGATTC TCAGACGGTG GTGTATGGCC CTTCTGTCAT CTATATGGTT
TCAGAACTGA TTCGTCAGTG GTCTGAAACA GGTGAAGGCG TGGTGATCCA CACACCCGCC
TATGACGCAT TTTACAAGGC CATTGAAGGT AACCAGCGCA CAGTAATGCC CGTTGCTTTA
GAGAAGCAGG CTGATGGTTG GTTTTGCGAT ATGGGCAAGT TGGAAGCCGT GTTGGCGAAA
CCAGAATGTA AAATTATGCT CCTGTGTAGC CCACAGAATC CTACCGGGAA AGTGTGGACG
TGCGATGAGC TGGAGATCAT GGCTGACCTG TGCGAGCGTC ATGGTGTGCG GGTTATTTCC
GATGAAATCC ATATGGATAT GGTTTGGGGC GAGCAGCCGC ATATTCCCTG GAGTAATGTG
GCTCGCGGAG ACTGGGCGTT GCTAACGTCG GGCTCGAAAA GTTTCAATAT TCCCGCCCTG
ACCGGTGCTT ACGGGATTAT AGAAAATAGC AGTAGCCGCG ATGCCTATTT ATCGGCACTG
AAAGGCCGTG ATGGGCTTTC TTCCCCTTCG GTACTGGCGT TAACTGCCCA TATCGCCGCC
TATCAGCAAG GCGCGCCGTG GCTGGATGCC TTACGCATCT ATCTGAAAGA TAACCTGACG
TATATCGCAG ATAAAATGAA CGCCGCGTTT CCTGAACTCA ACTGGCAGAT CCCACAATCC
ACTTATCTGG CATGGCTTGA TTTACGTCCG TTGAATATTG ACGACAACGC GTTGCAAAAA
GCACTTATCG AACAAGAAAA AGTCGCGATC ATGCCGGGGT ATACCTACGG TGAAGAAGGT
CGTGGTTTTG TCCGTCTCAA TGCCGGCTGC CCACGTTCGA AACTGGAAAA AGGTGTGGCT
GGATTAATTA ACGCCATCCG CGCTGTTCGT TAA
 
Protein sequence
MFDFSKVVDR HGTWCTQWDY VADRFGTADL LPFTISDMDF ATAPCIIEAL NQRLMHGVFG 
YSRWKNDEFL AAIAHWFSTQ HYTAIDSQTV VYGPSVIYMV SELIRQWSET GEGVVIHTPA
YDAFYKAIEG NQRTVMPVAL EKQADGWFCD MGKLEAVLAK PECKIMLLCS PQNPTGKVWT
CDELEIMADL CERHGVRVIS DEIHMDMVWG EQPHIPWSNV ARGDWALLTS GSKSFNIPAL
TGAYGIIENS SSRDAYLSAL KGRDGLSSPS VLALTAHIAA YQQGAPWLDA LRIYLKDNLT
YIADKMNAAF PELNWQIPQS TYLAWLDLRP LNIDDNALQK ALIEQEKVAI MPGYTYGEEG
RGFVRLNAGC PRSKLEKGVA GLINAIRAVR