Gene EcDH1_0386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0386 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp422751 
End bp423947 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content51% 
IMG OID 
Productgeneral secretion pathway protein F 
Protein accessionACX38076 
Protein GI260447654 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATC GCTATCGCGC CATGACCCAG GATGGTCAAA AATTGCAAGG GATCATTGAT 
GCTAACGATG AACGTCAGGC ACGACTGCGG CTGCGTGAAG AAGGGCTTTT CCTGCTGGAT
ATTCGCCCCC AAAAAAGTTC GGGAGTAAAA ACACGTCGCC CGAGGATCAG CCATAGTGAA
CTGACGCTTT TCACCCGGCA GTTGGCAACC TTAAGCGCAG CGGCATTACC CCTGGAAGAG
AGCCTTGCCG TAATCGGTCA ACAAAGCAGT AATAAACGAC TGGGTGACGT GTTAAATCAG
GTACGCAGCG CCATCCTTGA AGGGCATCCC CTTTCCGATG CATTACAGCA TTTTCCCACG
CTTTTCGATT CGCTCTATCG TACCCTGGTA AAAGCGGGCG AAAAGAGCGG GCTGCTGGCC
CCGGTGTTGG AAAAGCTGGC TGATTACAAT GAAAACCGGC AGAAAATCCG CAGCAAGCTC
ATTCAGTCAC TGATCTACCC CTGTATGCTC ACTACGGTGG CGATTGGGGT CGTGATTATT
CTCCTCACTG CTGTCGTGCC CAAAATTACC GAACAGTTCG TGCATATGAA GCAGCAACTG
CCGCTGAGTA CACGCATTCT TTTAGGTCTG AGCGACACGT TGCAACGTAC CGGCCCGACA
TTATTAGCGA CAGTGTTTAT TGTCGCTGTA GGTTTCTGGC TCTGGTTAAA ACGCGGCAAT
AACCGCCACC GTTTTCATGC CATGTTGCTG CGCGTTGCGC TCATCGGCCC GCTGATTTGC
GCCATTAACA GCGCACGCTA TCTCCGCACT TTAAGTATTT TGCAATCCAG CGGCGTCCCT
CTGCTGGATG GGATGAATTT GTCCACCGAA AGCCTCAACA ACCTCGAAAT TCGCCAGCGT
CTGGCAAATG CGGCAGAGAA CGTTCGCCAG GGTAACAGCA TTCATCTTTC GCTGGAACAA
ACCGCAATTT TCCCGCCGAT GATGCTCTAC ATGGTGGCCT CTGGCGAAAA AAGCGGGCAG
CTCGGCACAT TAATGGTCAG AGCCGCAGAT AACCAGGAGA CACTCCAACA AAATCGGATC
GCCTTAACGC TCTCCATCTT CGAGCCAGCA CTCATTATTA CGATGGCACT GATCGTCCTG
TTTATTGTCG TGTCGGTACT CCAACCTCTT CTTCAACTTA ACTCAATGAT TAATTAA
 
Protein sequence
MNYRYRAMTQ DGQKLQGIID ANDERQARLR LREEGLFLLD IRPQKSSGVK TRRPRISHSE 
LTLFTRQLAT LSAAALPLEE SLAVIGQQSS NKRLGDVLNQ VRSAILEGHP LSDALQHFPT
LFDSLYRTLV KAGEKSGLLA PVLEKLADYN ENRQKIRSKL IQSLIYPCML TTVAIGVVII
LLTAVVPKIT EQFVHMKQQL PLSTRILLGL SDTLQRTGPT LLATVFIVAV GFWLWLKRGN
NRHRFHAMLL RVALIGPLIC AINSARYLRT LSILQSSGVP LLDGMNLSTE SLNNLEIRQR
LANAAENVRQ GNSIHLSLEQ TAIFPPMMLY MVASGEKSGQ LGTLMVRAAD NQETLQQNRI
ALTLSIFEPA LIITMALIVL FIVVSVLQPL LQLNSMIN