Gene EcDH1_3490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3490 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3749829 
End bp3751199 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content53% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX41105 
Protein GI260450683 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.475795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGTC AACAGCACGG CGAGCAGCTA AAGCGCGGCC TTAAAAACCG CCATATTCAG 
CTTATCGCGC TGGGTGGCGC GATAGGGACC GGGTTATTCC TGGGTAGCGC CTCCGTAATA
CAGTCCGCAG GGCCAGGGAT TATCCTGGGT TACGCCATTG CTGGTTTTAT CGCCTTTCTG
ATCATGCGTC AGCTGGGTGA AATGGTGGTC GAAGAACCTG TCGCAGGCTC CTTTAGCCAC
TTTGCTTATA AATACTGGGG CAGTTTTGCC GGTTTCGCCT CTGGCTGGAA CTACTGGGTA
CTGTACGTTT TAGTTGCCAT GGCTGAGCTG ACTGCCGTGG GTAAATACAT TCAGTTCTGG
TATCCGGAAA TCCCCACCTG GGTTTCTGCC GCCGTATTCT TTGTGGTGAT TAACGCCATC
AACCTGACCA ACGTTAAAGT GTTTGGCGAG ATGGAGTTCT GGTTTGCCAT TATCAAAGTT
ATCGCGGTGG TAGCGATGAT CATCTTCGGC GGCTGGCTGC TATTCAGTGG CAACGGCGGC
CCGCAGGCGA CCGTTAGCAA CCTGTGGGAT CAGGGTGGTT TCCTGCCGCA CGGCTTCACC
GGGCTGGTGA TGATGATGGC GATTATCATG TTCTCGTTCG GTGGTCTGGA ACTGGTGGGG
ATCACCGCAG CAGAAGCTGA TAACCCGGAG CAAAGTATAC CGAAAGCAAC TAACCAGGTT
ATCTACCGCA TCCTGATTTT CTATATTGGT TCGTTAGCCG TTCTGCTCTC ACTGATGCCG
TGGACCCGCG TTACCGCCGA TACCAGTCCG TTTGTGCTGA TCTTCCACGA GTTAGGCGAT
ACCTTTGTGG CGAATGCGCT GAACATCGTG GTACTGACTG CGGCGCTCTC CGTGTACAAC
AGCTGCGTAT ATTGCAACAG CCGTATGCTG TTTGGTCTGG CACAACAGGG TAATGCGCCA
AAAGCGCTGG CGTCTGTCGA TAAACGTGGT GTACCAGTAA ATACCATTCT GGTGTCTGCA
CTGGTAACGG CGTTGTGCGT ACTGATTAAC TACCTTGCCC CAGAGTCCGC TTTCGGACTG
TTAATGGCGC TGGTGGTATC TGCACTGGTA ATCAACTGGG CGATGATTAG CCTGGCGCAT
ATGAAATTCC GTCGCGCCAA GCAGGAACAA GGCGTGGTAA CTCGCTTCCC TGCTCTGCTT
TATCCGCTGG GTAACTGGAT CTGCCTGCTG TTTATGGCGG CGGTACTGGT GATTATGCTG
ATGACCCCAG GAATGGCGAT TTCGGTATAC CTGATCCCGG TATGGCTGAT CGTGTTAGGT
ATCGGCTATC TGTTTAAAGA GAAAACCGCC AAAGCCGTAA AAGCGCATTA A
 
Protein sequence
MEGQQHGEQL KRGLKNRHIQ LIALGGAIGT GLFLGSASVI QSAGPGIILG YAIAGFIAFL 
IMRQLGEMVV EEPVAGSFSH FAYKYWGSFA GFASGWNYWV LYVLVAMAEL TAVGKYIQFW
YPEIPTWVSA AVFFVVINAI NLTNVKVFGE MEFWFAIIKV IAVVAMIIFG GWLLFSGNGG
PQATVSNLWD QGGFLPHGFT GLVMMMAIIM FSFGGLELVG ITAAEADNPE QSIPKATNQV
IYRILIFYIG SLAVLLSLMP WTRVTADTSP FVLIFHELGD TFVANALNIV VLTAALSVYN
SCVYCNSRML FGLAQQGNAP KALASVDKRG VPVNTILVSA LVTALCVLIN YLAPESAFGL
LMALVVSALV INWAMISLAH MKFRRAKQEQ GVVTRFPALL YPLGNWICLL FMAAVLVIML
MTPGMAISVY LIPVWLIVLG IGYLFKEKTA KAVKAH