Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2426 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 2603300 |
End bp | 2604694 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | putative invasin |
Protein accession | ACX40066 |
Protein GI | 260449644 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0439788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCCGTT TCGTTCCTCG CATTATCCCG TTTTATTTAC TCTTGCTTGT AGCAGGCGGT ACAGCTAACG CACAATCTAC CTTCGAGCAA AAAGCGGCAA ATCCCTTTGA TAATAACAAT GATGGTCTGC CGGATTTAGG CATGGCTCCC GAAAATCATG ATGGGGAAAA ACACTTTGCT GAAATTGTGA AAGATTTCGG CGAAACCAGT ATGAATGATA ACGGGCTGGA TACTGGCGAG CAGGCAAAAG CTTTCGCATT GGGAAAAGTC CGCGACGCGC TTAGTCAACA GGTTAATCAG CACGTAGAGT CCTGGCTATC ACCGTGGGGA AATGCCAGTG TTGACGTCAA AGTGGATAAC GAAGGACATT TCACCGGCAG TCGTGGAAGC TGGTTTGTGC CGTTACAAGA TAATGATCGT TATCTCACCT GGAGCCAGCT TGGTCTTACT CAGCAGGATA ATGGGTTGGT GAGCAATGTG GGCGTTGGGC AACGCTGGGC GCGCGGCAAC TGGCTGGTGG GTTATAACAC TTTTTATGAC AACTTGCTGG ACGAAAATCT TCAGCGAGCG GGCTTTGGTG CCGAAGCGTG GGGCGAATAT TTGCGATTAT CGGCAAACTT TTATCAGCCG TTTGCTGCAT GGCATGAACA GACAGCCACG CAGGAACAAC GGATGGCGCG CGGGTACGAC CTGACAGCTC GGATGCGCAT GCCGTTCTAT CAACACCTCA ATACCAGTGT CAGCCTAGAA CAGTATTTTG GTGATCGTGT TGATTTGTTT AACTCTGGTA CGGGTTATCA CAATCCCGTC GCGTTGAGTC TGGGATTAAA TTACACCCCT GTGCCATTAG TCACTGTGAC GGCCCAGCAT AAACAGGGTG AAAGTGGCGA AAATCAAAAT AACCTCGGGC TGAATCTTAA TTACCGCTTT GGTGTACCGC TCAAAAAACA ACTTTCTGCG GGCGAGGTTG CCGAAAGTCA GTCGTTACGT GGTAGTCGCT ATGACAATCC GCAGCGAAAT AATCTACCGA CTCTTGAGTA CCGACAGCGA AAAACGTTAA CGGTGTTTCT GGCGACACCG CCGTGGGATC TAAAACCTGG CGAAACAGTG CCGCTGAAAT TACAAATCCG CAGTCGTTAC GGTATTCGGC AACTGATTTG GCAGGGCGAT ACGCAGATAT TAAGTTTGAC GCCGGGCGCA CAAGCCAACA GCGCGGAGGG CTGGACGCTG ATCATGCCTG ACTGGCAGAA CGGGGAAGGG GCAAGCAATC ACTGGCGATT GTCTGTGGTG GTGGAAGATA ACCAGGGGCA GCGTGTCTCC TCCAATGAGA TCACGCTAAC GCTTGTCGAA CCGTTCGATG CATTGTCAAA CGACGAACTG CGCTGGGAAC CGTAA
|
Protein sequence | MSRFVPRIIP FYLLLLVAGG TANAQSTFEQ KAANPFDNNN DGLPDLGMAP ENHDGEKHFA EIVKDFGETS MNDNGLDTGE QAKAFALGKV RDALSQQVNQ HVESWLSPWG NASVDVKVDN EGHFTGSRGS WFVPLQDNDR YLTWSQLGLT QQDNGLVSNV GVGQRWARGN WLVGYNTFYD NLLDENLQRA GFGAEAWGEY LRLSANFYQP FAAWHEQTAT QEQRMARGYD LTARMRMPFY QHLNTSVSLE QYFGDRVDLF NSGTGYHNPV ALSLGLNYTP VPLVTVTAQH KQGESGENQN NLGLNLNYRF GVPLKKQLSA GEVAESQSLR GSRYDNPQRN NLPTLEYRQR KTLTVFLATP PWDLKPGETV PLKLQIRSRY GIRQLIWQGD TQILSLTPGA QANSAEGWTL IMPDWQNGEG ASNHWRLSVV VEDNQGQRVS SNEITLTLVE PFDALSNDEL RWEP
|
| |