Gene EcDH1_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2195 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2355413 
End bp2357515 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionACX39845 
Protein GI260449423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0192314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTT TTTCCGTCCG ACAGACCGTT TTGCCCGCAC TGCTTGTCCT TTCCCCCGTT 
GTTTTTGCCG CTGATGAACA GACTATGATT GTCAGTGCCG CACCGCAGGT GGTTTCAGAA
CTGGATACCC CAGCAGCAGT AAGCGTGGTG GATGGCGAGG AGATGCGCCT GGCAACACCG
CGCATTAACT TGTCCGAATC ACTGACCGGC GTGCCTGGTT TGCAGGTACA AAACCGGCAG
AACTATGCGC AAGATTTACA GCTGTCGATT CGCGGATTTG GCTCCCGCTC CACTTACGGT
ATTCGCGGTA TTCGCCTGTA TGTGGACGGT ATTCCCGCCA CCATGCCCGA CGGGCAAGGG
CAAACATCCA ACATCGATTT AAGCAGTGTG CAAAATGTGG AAGTGCTGCG TGGCCCCTTC
TCTGCCCTGT ATGGCAACGC GTCTGGTGGG GTAATGAATG TCACCACCCA GACCGGACAA
CAGCCACCAA CCATTGAAGC CAGTAGTTAC TACGGCAGTT TTGGCAGCTG GCGCTATGGG
CTGAAAGCAA CGGGCGCAAC GGGAGACGGC ACACAGCCTG GCGATGTCGA TTACACCGTC
TCAACCACGC GTTTTACGAC CCACGGCTAT CGTGACCATA GTGGCGCACA GAAAAATTTA
GCCAATGCCA AACTGGGCGT ACGCATTGAT GAAGCCAGCA AATTAAGTCT GATTTTCAAT
AGTGTGGATA TCAAAGCAGA TGACCCAGGT GGGCTAACCA AAGCAGAATG GAAGGCTAAT
CCACAACAAG CGCCTCGTGC AGAACAGTAC GACACGCGAA AAACCATCAA GCAAACTCAG
GCTGGGTTGC GCTATGAGCG TAGCCTGAGT TCGCGGGATG ATATGAGTGT GATGATGTAT
GCCGGAGAGC GAGAAACGAC CCAGTACCAG TCAATACCCA TGGCACCACA ACTTAACCCG
TCACATGCGG GCGGCGTGAT TACCCTGCAA CGCCATTACC AGGGAATAGA CAGCCGCTGG
ACACACCGTG GTGAACTGGG CGTTCCGGTC ACGTTCACTA CCGGCCTGAA CTACGAAAAC
ATGAGTGAAA ACCGCAAGGG CTACAATAAC TTCCGCCTGA ATAGCGGCAT GCCGGAGTAC
GGGCAAAAAG GTGAGTTGCG TCGCGACGAA CGCAATCTGA TGTGGAACAT CGATCCCTAT
TTACAGACGC AGTGGCAGCT GAGCGAAAAA CTGTCGCTGG ATGCTGGCGT GCGCTACAGC
TCCGTGTGGT TTGATTCCAA CGACCATTAC GTTACTCCGG GTAACGGCGA TGACAGCGGT
GATGCCAGTT ATCATAAATG GCTACCTGCC GGTTCGTTAA AATATGCAAT GACCGATGCC
TGGAATATCT ATCTGGCAGC CGGGCGAGGT TTTGAAACGC CGACGATTAA TGAGCTGTCT
TATCGTGCTG ATGGGCAAAG CGGTATGAAC TTAGGTTTAA AACCATCCAC CAACGATACA
ATTGAGATCG GCAGTAAAAC GCGTATTGGT GATGGGCTGC TTAGTCTCGC ATTGTTTCAG
ACCGACACTG ATGATGAAAT TGTTGTCGAT AGCAGTAGCG GTGGGCGTAC GACTTACAAA
AATGCCGGAA AGACCCGTCG TCAAGGCGCT GAACTGGCAT GGGATCAACG TTTCGCAGGA
GATTTTCGCG TAAACGCGTC CTGGACCTGG CTTGATGCGA CCTATCGCAG CAATGTTTGC
AATGAACAGG ATTGTAACGG TAATCGGATG CCAGGGATCG CCCGTAATAT GGGCTTTGCG
TCGATAGGTT ATGTACCGGA AGATGGTTGG TATGCAGGCA CGGAAGCGCG TTATATGGGC
GATATTATGG CAGATGATGA AAATACGGCA AAAGCGCCGT CTTATACTCT CGTCGGCTTA
TTCACCGGGT ATAAATACAA TTACCACAAT TTAACTGTGG ATTTATTTGG TCGTGTCGAT
AATTTATTCG ATAAAGAATA CGTTGGTTCT GTCATTGTCA ATGAGTCAAA CGGGCGATAT
TACGAACCTT CGCCCGGACG AAATTATGGT GTCGGCATGA ATATTGCGTG GAGATTTGAG
TAA
 
Protein sequence
MKIFSVRQTV LPALLVLSPV VFAADEQTMI VSAAPQVVSE LDTPAAVSVV DGEEMRLATP 
RINLSESLTG VPGLQVQNRQ NYAQDLQLSI RGFGSRSTYG IRGIRLYVDG IPATMPDGQG
QTSNIDLSSV QNVEVLRGPF SALYGNASGG VMNVTTQTGQ QPPTIEASSY YGSFGSWRYG
LKATGATGDG TQPGDVDYTV STTRFTTHGY RDHSGAQKNL ANAKLGVRID EASKLSLIFN
SVDIKADDPG GLTKAEWKAN PQQAPRAEQY DTRKTIKQTQ AGLRYERSLS SRDDMSVMMY
AGERETTQYQ SIPMAPQLNP SHAGGVITLQ RHYQGIDSRW THRGELGVPV TFTTGLNYEN
MSENRKGYNN FRLNSGMPEY GQKGELRRDE RNLMWNIDPY LQTQWQLSEK LSLDAGVRYS
SVWFDSNDHY VTPGNGDDSG DASYHKWLPA GSLKYAMTDA WNIYLAAGRG FETPTINELS
YRADGQSGMN LGLKPSTNDT IEIGSKTRIG DGLLSLALFQ TDTDDEIVVD SSSGGRTTYK
NAGKTRRQGA ELAWDQRFAG DFRVNASWTW LDATYRSNVC NEQDCNGNRM PGIARNMGFA
SIGYVPEDGW YAGTEARYMG DIMADDENTA KAPSYTLVGL FTGYKYNYHN LTVDLFGRVD
NLFDKEYVGS VIVNESNGRY YEPSPGRNYG VGMNIAWRFE