Gene EcDH1_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3053 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3277784 
End bp3279196 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content53% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX40681 
Protein GI260450259 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCCTCA ACAAAAAAGA CACACAGGGG AAAGGCGTGA AAAACGCGTC AACCGTATCG 
GAAGATACTG CGTCGAATCA AGAGCCGACG CTTCATCGCG GATTACATAA CCGTCATATT
CAACTGATTG CGTTGGGTGG CGCAATTGGT ACTGGTCTGT TTCTTGGCAT TGGCCCGGCG
ATTCAGATGG CGGGTCCGGC TGTATTGCTG GGCTACGGCG TCGCCGGGAT CATCGCTTTC
CTGATTATGC GCCAGCTTGG CGAAATGGTG GTTGAGGAGC CGGTATCCGG TTCATTTGCC
CACTTTGCCT ATAAATACTG GGGACCGTTT GCGGGCTTCC TCTCTGGCTG GAACTACTGG
GTAATGTTCG TGCTGGTGGG AATGGCAGAG CTGACCGCTG CGGGCATCTA TATGCAGTAC
TGGTTCCCGG ATGTTCCAAC GTGGATTTGG GCTGCCGCCT TCTTTATTAT CATCAACGCC
GTTAACCTGG TGAACGTGCG CTTATATGGC GAAACCGAGT TCTGGTTTGC GTTGATTAAA
GTGCTGGCAA TCATCGGTAT GATCGGCTTT GGCCTGTGGC TGCTGTTTTC TGGTCACGGC
GGCGAGAAAG CCAGTATCGA CAACCTCTGG CGCTACGGTG GTTTCTTCGC CACCGGCTGG
AATGGGCTGA TTTTGTCGCT GGCGGTAATT ATGTTCTCCT TCGGCGGTCT GGAGCTGATT
GGGATTACTG CCGCTGAAGC GCGCGATCCG GAAAAAAGCA TTCCAAAAGC GGTAAATCAG
GTGGTGTATC GCATCCTGCT GTTTTACATC GGTTCACTGG TGGTTTTACT GGCGCTCTAT
CCGTGGGTGG AAGTGAAATC CAACAGTAGC CCGTTTGTGA TGATTTTCCA TAATCTCGAC
AGCAACGTGG TAGCTTCTGC GCTGAACTTC GTCATTCTGG TAGCATCGCT GTCAGTGTAT
AACAGCGGGG TTTACTCTAA CAGCCGCATG CTGTTTGGCC TTTCTGTGCA GGGTAATGCG
CCGAAGTTTT TGACTCGCGT CAGCCGTCGC GGTGTGCCGA TTAACTCGCT GATGCTTTCC
GGAGCGATCA CTTCGCTGGT GGTGTTAATC AACTATCTGC TGCCGCAAAA AGCGTTTGGT
CTGCTGATGG CGCTGGTGGT AGCAACGCTG CTGTTGAACT GGATTATGAT CTGTCTGGCG
CATCTGCGTT TTCGTGCAGC GATGCGACGT CAGGGGCGTG AAACACAGTT TAAGGCGCTG
CTCTATCCGT TCGGCAACTA TCTCTGCATT GCCTTCCTCG GCATGATTTT GCTGCTGATG
TGCACGATGG ATGATATGCG CTTGTCAGCG ATCCTGCTGC CGGTGTGGAT TGTATTCCTG
TTTATGGCAT TTAAAACGCT GCGTCGGAAA TAA
 
Protein sequence
MPLNKKDTQG KGVKNASTVS EDTASNQEPT LHRGLHNRHI QLIALGGAIG TGLFLGIGPA 
IQMAGPAVLL GYGVAGIIAF LIMRQLGEMV VEEPVSGSFA HFAYKYWGPF AGFLSGWNYW
VMFVLVGMAE LTAAGIYMQY WFPDVPTWIW AAAFFIIINA VNLVNVRLYG ETEFWFALIK
VLAIIGMIGF GLWLLFSGHG GEKASIDNLW RYGGFFATGW NGLILSLAVI MFSFGGLELI
GITAAEARDP EKSIPKAVNQ VVYRILLFYI GSLVVLLALY PWVEVKSNSS PFVMIFHNLD
SNVVASALNF VILVASLSVY NSGVYSNSRM LFGLSVQGNA PKFLTRVSRR GVPINSLMLS
GAITSLVVLI NYLLPQKAFG LLMALVVATL LLNWIMICLA HLRFRAAMRR QGRETQFKAL
LYPFGNYLCI AFLGMILLLM CTMDDMRLSA ILLPVWIVFL FMAFKTLRRK