Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3851 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4144169 |
End bp | 4145425 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | amino acid permease-associated region |
Protein accession | ACX41453 |
Protein GI | 260451031 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGAC TCAAACAAGA ACTGGGGCTG GCCCAGGGCA TTGGCCTGCT ATCGACGTCA TTATTAGGCA CTGGCGTGTT TGCCGTTCCT GCGTTAGCTG CGCTGGTAGC GGGCAATAAC AGCCTGTGGG CGTGGCCCGT TTTGATTATC TTAGTGTTCC CGATTGCGAT TGTGTTTGCG ATTCTGGGTC GCCACTATCC CAGCGCAGGC GGCGTCGCGC ACTTCGTCGG TATGGCGTTT GGTTCGCGGC TTGAGCGAGT CACCGGCTGG CTGTTTTTAT CGGTCATTCC CGTGGGTTTG CCTGCCGCAC TACAAATTGC CGCCGGGTTC GGCCAGGCGA TGTTTGGCTG GCATAGCTGG CAACTGTTGT TGGCAGAACT CGGTACGCTG GCGCTGGTGT GGTATATCGG TACTCGCGGT GCCAGTTCCA GTGCTAATCT ACAAACCGTT ATTGCCGGAC TTATCGTCGC GCTGATTGTC GCTATCTGGT GGGCGGGCGA TATCAAACCT GCGAATATCC CCTTTCCGGC ACCTGGTAAT ATCGAACTTA CCGGGTTATT TGCTGCGTTA TCAGTGATGT TCTGGTGTTT TGTCGGTCTG GAGGCATTTG CCCATCTCGC CTCGGAATTT AAAAATCCAG AGCGTGATTT TCCTCGTGCT TTGATGATTG GTCTGCTGCT GGCAGGATTA GTCTACTGGG GCTGTACGGT AGTCGTCTTA CACTTCGACG CCTATGGTGA AAAAATGGCG GCGGCAGCAT CGCTTCCAAA AATTGTAGTG CAGTTGTTCG GTGTAGGAGC GTTATGGATT GCCTGCGTGA TTGGCTATCT GGCCTGCTTT GCCAGTCTCA ACATTTATAT ACAGAGCTTC GCCCGCCTGG TCTGGTCGCA GGCGCAACAT AATCCTGACC ACTACCTGGC ACGCCTCTCT TCTCGCCATA TCCCGAATAA TGCCCTCAAT GCGGTGCTCG GCTGCTGTGT GGTGAGCACT TTGGTGATTC ATGCTTTAGA GATCAATCTG GACGCTCTTA TTATTTATGC CAATGGCATC TTTATTATGA TTTATCTGTT ATGCATGCTG GCAGGCTGTA AATTATTGCA AGGACGTTAT CGACTACTGG CGGTGGTTGG CGGGCTGTTA TGCGTTCTGT TACTGGCAAT GGTCGGCTGG AAAAGTCTCT ATGCGCTGAT CATGCTGGCG GGGTTATGGC TGTTGCTGCC AAAACGAAAA ACGCCGGAAA ATGGCATAAC CACATAA
|
Protein sequence | MSGLKQELGL AQGIGLLSTS LLGTGVFAVP ALAALVAGNN SLWAWPVLII LVFPIAIVFA ILGRHYPSAG GVAHFVGMAF GSRLERVTGW LFLSVIPVGL PAALQIAAGF GQAMFGWHSW QLLLAELGTL ALVWYIGTRG ASSSANLQTV IAGLIVALIV AIWWAGDIKP ANIPFPAPGN IELTGLFAAL SVMFWCFVGL EAFAHLASEF KNPERDFPRA LMIGLLLAGL VYWGCTVVVL HFDAYGEKMA AAASLPKIVV QLFGVGALWI ACVIGYLACF ASLNIYIQSF ARLVWSQAQH NPDHYLARLS SRHIPNNALN AVLGCCVVST LVIHALEINL DALIIYANGI FIMIYLLCML AGCKLLQGRY RLLAVVGGLL CVLLLAMVGW KSLYALIMLA GLWLLLPKRK TPENGITT
|
| |