Gene EcDH1_2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2141 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2288477 
End bp2289625 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content45% 
IMG OID 
Productfimbrial biogenesis outer membrane usher protein 
Protein accessionACX39794 
Protein GI260449372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000830223 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGTT ACACCGTCAA GCCTCCTACC GGAGACACCA ATGAGCAGAC ACAATTTATT 
GATTATTTTA ATCTGTTCTA CAGTAAGCGT GGTCAGGAAC AAATAAGCAT CTCTCAGCAG
CTTGGAAATT ACGGTACGAC ATTTTTCAGT GCCAGTCGCC AAAGTTACTG GAACACGTCA
CGCAGCGACC AGCAAATATC ATTTGGATTA AATGTGCCGT TTGGTGATAT TACGACTTCG
CTGAATTACA GCTATTCCAA TAATATATGG CAAAACGATC GGGATCATTT ACTCGCTTTT
ACGCTTAATG TTCCCTTCAG TCATTGGATG CGTACAGACA GTCAGTCGGC ATTTCGTAAT
TCAAACGCCA GTTACAGTAT GTCAAACGAT TTGAAAGGCG GCATGACCAA TCTATCGGGG
GTTTATGGCA CTCTGCTGCC GGATAATAAC CTGAATTATA GCGTTCAGGT CGGTAACACC
CACGGAGGTA ATACATCGTC TGGCACCAGT GGTTACAGTT CTCTTAATTA TCGTGGAGCT
TATGGTAATA CTAATGTCGG TTACAGTCGG AGTGGTGACA GCAGCCAGAT TTATTACGGA
ATGAGTGGTG GGATTATTGC TCATGCTGAT GGCATCACCT TTGGACAGCC GCTGGGCGAC
ACAATGGTTC TGGTTAAGGC TCCTGGTGCT GATAATGTCA AAATAGAGAA CCAGACCGGA
ATTCATACCG ACTGGCGTGG CTATGCCATA TTACCATTTG CGACAGAATA TAGAGAAAAC
CGTGTTGCTC TTAACGCGAA TTCCCTTGCA GATAATGTTG AACTGGATGA AACCGTGGTC
ACTGTCATCC CAACTCACGG TGCTATTGCC AGAGCAACAT TTAATGCACA AATCGGCGGG
AAAGTATTAA TGACGTTGAA GTACGGTAAT AAGAGCGTTC CATTCGGTGC AATTGTCACA
CACGGAGAGA ATAAAAATGG CAGCATTGTC GCGGAAAATG GTCAGGTTTA TCTGACTGGA
CTTCCACAGT CAGGGCAATT ACAGGTTTCA TGGGGCAAAG ATAAAAACTC AAACTGTATT
GTCGAGTACA AGCTTCCTGA AGTTTCTCCT GGTACCTTAC TGAACCAGCA GACAGCAATC
TGTCGCTAA
 
Protein sequence
MSGYTVKPPT GDTNEQTQFI DYFNLFYSKR GQEQISISQQ LGNYGTTFFS ASRQSYWNTS 
RSDQQISFGL NVPFGDITTS LNYSYSNNIW QNDRDHLLAF TLNVPFSHWM RTDSQSAFRN
SNASYSMSND LKGGMTNLSG VYGTLLPDNN LNYSVQVGNT HGGNTSSGTS GYSSLNYRGA
YGNTNVGYSR SGDSSQIYYG MSGGIIAHAD GITFGQPLGD TMVLVKAPGA DNVKIENQTG
IHTDWRGYAI LPFATEYREN RVALNANSLA DNVELDETVV TVIPTHGAIA RATFNAQIGG
KVLMTLKYGN KSVPFGAIVT HGENKNGSIV AENGQVYLTG LPQSGQLQVS WGKDKNSNCI
VEYKLPEVSP GTLLNQQTAI CR