Gene EcDH1_4180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4180 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4532198 
End bp4533583 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX41780 
Protein GI260451358 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000000115482 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATA ACAAACCAGA GCTACAGCGT GGGCTGGAAG CTCGACATAT CGAACTCATC 
GCCCTGGGGG GCACCATTGG CGTCGGCCTG TTTATGGGGG CCGCCAGTAC CCTGAAATGG
GCCGGGCCAT CCGTATTGTT GGCCTATATC ATCGCCGGGC TGTTCGTCTT TTTCATCATG
CGTTCAATGG GCGAAATGTT GTTCCTCGAA CCGGTTACCG GTTCGTTCGC CGTTTATGCG
CATCGTTATA TGAGCCCGTT CTTTGGCTAT CTCACCGCCT GGTCTTACTG GTTTATGTGG
ATGGCGGTGG GGATCTCTGA AATCACCGCC ATTGGCGTTT ATGTCCAGTT CTGGTTCCCG
GAGATGGCGC AGTGGATACC CGCATTGATC GCAGTGGCGC TGGTGGCGTT GGCGAATCTG
GCGGCGGTGC GGTTGTACGG CGAAATCGAG TTCTGGTTCG CGATGATCAA AGTCACCACG
ATTATCGTGA TGATTGTCAT TGGCCTGGGC GTGATTTTCT TTGGCTTTGG CAATGGCGGG
CAGTCGATTG GTTTTAGCAA TCTCACAGAG CATGGCGGTT TCTTTGCGGG TGGCTGGAAA
GGGTTCCTGA CCGCTCTGTG TATTGTGGTG GCGTCCTACC AGGGCGTGGA GCTGATTGGC
ATTACTGCCG GTGAAGCGAA GAATCCGCAG GTGACGCTGC GCAGTGCCGT AGGCAAGGTG
CTGTGGCGGA TCCTGATTTT CTACGTAGGC GCGATTTTCG TTATCGTCAC CATCTTCCCG
TGGAATGAAA TAGGCAGCAA CGGCAGCCCG TTCGTACTGA CTTTTGCCAA AATCGGTATT
ACCGCAGCGG CGGGCATTAT CAACTTTGTG GTGCTGACGG CTGCGCTCTC TGGCTGTAAC
AGCGGCATGT ACAGTTGCGG ACGTATGCTC TACGCACTGG CGAAAAACCG TCAGTTACCG
GCGGCAATGG CGAAAGTTTC CCGTCACGGC GTACCGGTTG CGGGTGTGGC AGTATCTATT
GCTATTCTGC TAATTGGCTC ATGCCTGAAC TACATCATTC CCAATCCGCA GCGTGTGTTT
GTCTACGTCT ACAGTGCCAG CGTGCTTCCG GGGATGGTGC CATGGTTTGT GATATTGATA
AGCCAGCTGC GTTTTCGGCG TGCACATAAA GCGGCGATTG CCAGCCATCC GTTCCGCTCA
ATCCTGTTCC CGTGGGCCAA TTACGTAACA ATGGCATTCC TGATTTGCGT TTTGATCGGC
ATGTACTTTA ATGAAGATAC GCGTATGTCG CTGTTTGTTG GCATCATCTT TATGCTGGCG
GTGACGGCGA TTTATAAAGT TTTTGGCCTT AATCGCCACG GGAAAGCGCA TAAACTGGAG
GAATAA
 
Protein sequence
MADNKPELQR GLEARHIELI ALGGTIGVGL FMGAASTLKW AGPSVLLAYI IAGLFVFFIM 
RSMGEMLFLE PVTGSFAVYA HRYMSPFFGY LTAWSYWFMW MAVGISEITA IGVYVQFWFP
EMAQWIPALI AVALVALANL AAVRLYGEIE FWFAMIKVTT IIVMIVIGLG VIFFGFGNGG
QSIGFSNLTE HGGFFAGGWK GFLTALCIVV ASYQGVELIG ITAGEAKNPQ VTLRSAVGKV
LWRILIFYVG AIFVIVTIFP WNEIGSNGSP FVLTFAKIGI TAAAGIINFV VLTAALSGCN
SGMYSCGRML YALAKNRQLP AAMAKVSRHG VPVAGVAVSI AILLIGSCLN YIIPNPQRVF
VYVYSASVLP GMVPWFVILI SQLRFRRAHK AAIASHPFRS ILFPWANYVT MAFLICVLIG
MYFNEDTRMS LFVGIIFMLA VTAIYKVFGL NRHGKAHKLE E