Gene EcDH1_3160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3160 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3398538 
End bp3400319 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content56% 
IMG OID 
ProductABC transporter related protein 
Protein accessionACX40786 
Protein GI260450364 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAGTT TTAGCCAACT GTGGCCGACT CTCAAGCGCC TGTTAGCGTA CGGTTCGCCG 
TGGCGTAAAC CGCTGGGGAT TGCGGTCCTG ATGATGTGGG TTGCGGCGGC GGCAGAAGTC
AGTGGGCCGC TGCTTATCAG CTATTTTATC GACAATATGG TAGCGAAAAA TAACCTGCCG
TTGAAAGTGG TTGCAGGGCT GGCTGCGGCG TATGTTGGGC TGCAACTGTT TGCCGCCGGG
CTACATTACG CGCAGTCGCT GCTGTTTAAT CGGGCGGCAG TAGGCGTAGT GCAACAGTTG
CGTACCGACG TGATGGATGC TGCGTTACGC CAGCCATTAA GCGAGTTTGA TACCCAACCC
GTCGGGCAGG TGATTTCCCG CGTCACTAAT GACACTGAAG TGATCCGCGA TCTCTACGTT
ACCGTAGTGG CAACTGTCCT GCGCAGTGCC GCGCTGGTGG GCGCGATGCT GGTGGCGATG
TTCAGCCTCG ACTGGCGAAT GGCACTGGTG GCGATAATGA TTTTCCCGGT GGTGCTGGTG
GTAATGGTGA TATACCAGCG TTACAGCACG CCGATTGTCC GTCGTGTGCG CGCCTATTTG
GCGGATATCA ACGACGGCTT TAACGAAATC ATCAATGGCA TGAGCGTTAT CCAGCAGTTT
CGTCAGCAGG CGCGATTTGG CGAACGTATG GGGGAGGCCA GTCGTTCACA CTATATGGCG
AGGATGCAAA CCCTGCGCCT CGACGGTTTT CTGCTGCGTC CGCTGCTGAG TCTGTTTTCA
TCGCTCATTC TTTGTGGCTT GTTGATGCTG TTTGGCTTCT CCGCCAGCGG CACCATTGAA
GTGGGCGTGC TGTATGCGTT TATCAGCTAT CTTGGGCGAC TTAACGAACC ATTAATCGAA
CTGACCACGC AACAGGCGAT GCTGCAACAG GCTGTTGTTG CTGGTGAGCG CGTGTTTGAA
CTGATGGACG GACCGCGCCA GCAATATGGC AATGATGATC GCCCGTTACA GAGTGGCACC
ATCGAAGTCG ATAACGTGTC ATTTGCTTAT CGCGATGACA ATCTGGTGCT AAAGAACATT
AATCTCTCTG TGCCTTCGCG CAATTTTGTG GCGCTGGTCG GGCATACCGG CAGTGGCAAA
AGCACCCTCG CCAGTTTATT GATGGGCTAT TACCCGCTAA CGGAAGGTGA GATTCGCCTT
GATGGTCGTC CATTAAGTTC GCTAAGTCAC AGCGCGCTGC GCCAGGGCGT GGCAATGGTG
CAGCAAGATC CGGTGGTGCT GGCGGATACC TTCCTCGCCA ACGTGACGCT GGGGCGGGAT
ATCTCCGAAG AACGCGTCTG GCAGGCGCTG GAAACCGTGC AACTGGCGGA GCTGGCGCGT
AGCATGAGCG ACGGTATTTA CACGCCGCTG GGCGAGCAGG GGAATAATCT CTCAGTTGGG
CAAAAGCAAC TGCTGGCACT GGCGCGCGTG CTGGTCGAGA CGCCGCAAAT CCTGATCCTT
GATGAGGCAA CCGCCAGCAT TGACTCCGGT ACTGAACAGG CGATTCAACA TGCTCTGGCG
GCGGTGCGTG AACATACCAC GCTGGTAGTG ATTGCTCACC GCTTATCGAC CATTGTTGAT
GCCGACACCA TTCTGGTGCT TCATCGTGGG CAAGCCGTGG AGCAGGGCAC TCACCAGCAA
CTGCTGGCGG CCCAGGGACG CTACTGGCAG ATGTATCAAC TGCAACTTGC GGGCGAAGAG
CTGGCAGCCA GCGTGCGTGA AGAGGAATCA TTGAGCGCCT GA
 
Protein sequence
MRSFSQLWPT LKRLLAYGSP WRKPLGIAVL MMWVAAAAEV SGPLLISYFI DNMVAKNNLP 
LKVVAGLAAA YVGLQLFAAG LHYAQSLLFN RAAVGVVQQL RTDVMDAALR QPLSEFDTQP
VGQVISRVTN DTEVIRDLYV TVVATVLRSA ALVGAMLVAM FSLDWRMALV AIMIFPVVLV
VMVIYQRYST PIVRRVRAYL ADINDGFNEI INGMSVIQQF RQQARFGERM GEASRSHYMA
RMQTLRLDGF LLRPLLSLFS SLILCGLLML FGFSASGTIE VGVLYAFISY LGRLNEPLIE
LTTQQAMLQQ AVVAGERVFE LMDGPRQQYG NDDRPLQSGT IEVDNVSFAY RDDNLVLKNI
NLSVPSRNFV ALVGHTGSGK STLASLLMGY YPLTEGEIRL DGRPLSSLSH SALRQGVAMV
QQDPVVLADT FLANVTLGRD ISEERVWQAL ETVQLAELAR SMSDGIYTPL GEQGNNLSVG
QKQLLALARV LVETPQILIL DEATASIDSG TEQAIQHALA AVREHTTLVV IAHRLSTIVD
ADTILVLHRG QAVEQGTHQQ LLAAQGRYWQ MYQLQLAGEE LAASVREEES LSA