Gene EcDH1_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1503 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1630090 
End bp1632081 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content53% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionACX39173 
Protein GI260448751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000115095 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAGGT TGAACCCTTT CGTACGGGTC GGGCTGTGTT TGTCCGCTAT TTCTTGTGCA 
TGGCCTGTGT TAGCGGTCGA TGATGATGGC GAAACGATGG TTGTCACTGC ATCTTCCGTG
GAACAAAATC TTAAAGATGC ACCTGCCAGT ATCAGCGTCA TTACCCAGGA AGACCTGCAG
CGAAAACCGG TACAGAATCT GAAGGATGTC CTCAAAGAAG TGCCTGGCGT ACAACTGACG
AACGAAGGGG ATAACCGTAA GGGCGTTAGT ATTCGTGGTC TGGACAGCAG CTATACCCTG
ATTCTCGTCG ACGGTAAACG CGTGAACTCC CGCAATGCCG TCTTCCGCCA CAATGATTTC
GATCTGAACT GGATCCCGGT CGATTCCATC GAACGTATTG AAGTGGTCCG TGGCCCGATG
TCGTCGCTGT ACGGTTCCGA TGCGCTCGGC GGTGTAGTGA ATATCATCAC CAAAAAAATC
GGTCAGAAAT GGTCGGGTAC CGTTACCGTC GATACCACCA TTCAGGAACA TCGCGATCGC
GGTGACACCT ATAACGGTCA GTTCTTTACC AGTGGACCAT TAATTGATGG TGTGCTGGGA
ATGAAAGCTT ACGGCAGCCT GGCAAAACGT GAAAAGGATG ACCCGCAAAA CTCAACGACC
ACCGATACCG GAGAAACGCC GCGTATTGAA GGATTCTCCA GCCGCGACGG CAATGTCGAA
TTTGCCTGGA CACCGAATCA AAATCACGAT TTTACTGCCG GATACGGTTT CGACCGTCAG
GATCGTGATT CCGACTCGCT GGACAAAAAC CGCCTGGAAC GCCAGAACTA CTCCGTCAGC
CATAATGGGC GTTGGGATTA CGGCACCAGC GAACTGAAAT ACTACGGTGA GAAAGTCGAG
AACAAAAACC CTGGCAACAG CAGCCCGATA ACTTCCGAAA GCAATACGGT CGACGGCAAA
TACACGTTGC CGCTGACGGC GATTAATCAG TTTCTCACGG TTGGCGGTGA ATGGCGTCAC
GACAAACTTA GCGATGCGGT GAACCTGACC GGGGGAACCA GCTCCAAAAC GTCTGCCAGC
CAGTACGCGC TGTTTGTGGA AGATGAATGG CGGATCTTCG AGCCGCTGGC GCTGACGACC
GGCGTGCGTA TGGACGATCA CGAAACCTAC GGTGAACACT GGAGTCCGCG TGCCTACCTG
GTTTATAACG CCACCGACAC CGTAACGGTG AAAGGGGGCT GGGCGACGGC ATTTAAAGCA
CCTTCTCTGT TGCAACTTAG CCCTGACTGG ACGAGCAATT CCTGCCGTGG CGCATGTAAG
ATTGTGGGTA GCCCGGATCT GAAACCAGAA ACCAGCGAAA GTTGGGAGCT GGGGCTTTAC
TACATGGGTG AAGAAGGCTG GCTGGAAGGG GTTGAATCCA GCGTTACCGT TTTCCGTAAC
GATGTGAAAG ATCGTATCAG CATCAGCCGT ACGTCTGACG TCAACGCTGC ACCGGGCTAC
CAAAACTTTG TTGGTTTTGA GACGGGCGCT AACGGACGGC GCATACCGGT ATTTAGCTAC
TACAACGTTA ACAAAGCTCG TATTCAGGGC GTGGAAACCG AACTGAAAAT TCCGTTCAAC
GATGAATGGA AACTGTCGAT CAACTACACC TACAACGATG GTCGTGATGT CAGCAACGGC
GAAAACAAAC CGCTATCCGA TCTGCCGTTC CATACTGCTA ACGGTACGCT GGACTGGAAA
CCGCTGGCGC TGGAAGACTG GTCATTCTAT GTTTCTGGGC ACTATACCGG GCAGAAACGC
GCCGACAGCG CGACGGCTAA AACACCGGGC GGTTATACCA TCTGGAATAC CGGCGCGGCC
TGGCAGGTGA CTAAAGACGT CAAACTGCGC GCAGGCGTGC TGAACCTTGG CGACAAGGAT
CTCAGTCGTG ACGACTACAG CTATAACGAA GACGGACGTC GTTACTTTAT GGCAGTGGAT
TATCGCTTCT GA
 
Protein sequence
MFRLNPFVRV GLCLSAISCA WPVLAVDDDG ETMVVTASSV EQNLKDAPAS ISVITQEDLQ 
RKPVQNLKDV LKEVPGVQLT NEGDNRKGVS IRGLDSSYTL ILVDGKRVNS RNAVFRHNDF
DLNWIPVDSI ERIEVVRGPM SSLYGSDALG GVVNIITKKI GQKWSGTVTV DTTIQEHRDR
GDTYNGQFFT SGPLIDGVLG MKAYGSLAKR EKDDPQNSTT TDTGETPRIE GFSSRDGNVE
FAWTPNQNHD FTAGYGFDRQ DRDSDSLDKN RLERQNYSVS HNGRWDYGTS ELKYYGEKVE
NKNPGNSSPI TSESNTVDGK YTLPLTAINQ FLTVGGEWRH DKLSDAVNLT GGTSSKTSAS
QYALFVEDEW RIFEPLALTT GVRMDDHETY GEHWSPRAYL VYNATDTVTV KGGWATAFKA
PSLLQLSPDW TSNSCRGACK IVGSPDLKPE TSESWELGLY YMGEEGWLEG VESSVTVFRN
DVKDRISISR TSDVNAAPGY QNFVGFETGA NGRRIPVFSY YNVNKARIQG VETELKIPFN
DEWKLSINYT YNDGRDVSNG ENKPLSDLPF HTANGTLDWK PLALEDWSFY VSGHYTGQKR
ADSATAKTPG GYTIWNTGAA WQVTKDVKLR AGVLNLGDKD LSRDDYSYNE DGRRYFMAVD
YRF