Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1792 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 1945220 |
End bp | 1946398 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | phosphoribosylglycinamide formyltransferase 2 |
Protein accession | ACX39451 |
Protein GI | 260449029 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00174349 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTTAT TAGGCACTGC GCTGCGTCCG GCAGCAACTC GCGTGATGTT ATTAGGCTCC GGTGAACTGG GTAAAGAAGT GGCAATCGAG TGTCAGCGTC TCGGCGTAGA GGTGATTGCC GTCGATCGCT ATGCCGACGC ACCAGCCATG CATGTCGCGC ATCGCTCCCA TGTCATTAAT ATGCTTGATG GTGATGCATT ACGCCGTGTG GTTGAACTGG AAAAACCACA TTATATCGTG CCGGAGATCG AAGCTATTGC CACCGATATG CTGATCCAAC TTGAAGAGGA AGGACTGAAT GTTGTCCCCT GCGCTCGCGC AACGAAATTA ACGATGAATC GCGAGGGTAT CCGTCGCCTG GCGGCAGAAG AGCTGCAGCT GCCCACTTCC ACTTATCGTT TTGCCGATAG CGAAAGCCTT TTCCGCGAGG CGGTTGCTGA CATTGGCTAT CCCTGCATTG TAAAACCGGT GATGAGCTCT TCCGGCAAGG GGCAGACGTT TATTCGTTCT GCAGAGCAAC TTGCTCAGGC ATGGAAGTAC GCTCAGCAAG GCGGTCGCGC CGGAGCGGGC CGCGTAATTG TTGAAGGCGT CGTTAAGTTT GACTTCGAAA TTACCCTGCT AACCGTCAGC GCGGTGGATG GCGTCCATTT CTGTGCACCA GTAGGTCATC GCCAGGAAGA TGGCGACTAC CGTGAATCCT GGCAACCACA GCAAATGAGC CCGCTTGCCC TTGAACGTGC GCAGGAGATT GCCCGTAAAG TGGTGCTGGC ACTGGGCGGT TATGGGTTGT TTGGTGTCGA GCTATTTGTC TGTGGTGATG AGGTGATTTT CAGTGAGGTC TCCCCTCGTC CACATGATAC CGGGATGGTG ACGTTAATTT CTCAAGATCT CTCAGAGTTT GCCCTGCATG TACGTGCCTT CCTCGGACTT CCGGTTGGCG GGATCCGTCA GTATGGTCCT GCAGCTTCTG CCGTTATTCT GCCACAACTG ACCAGTCAGA ATGTCACGTT TGATAATGTG CAGAATGCCG TAGGCGCAGA TTTGCAGATT CGTTTATTTG GTAAGCCGGA AATTGATGGC AGCCGTCGTC TGGGGGTGGC ACTGGCTACT GCAGAGAGTG TTGTTGACGC CATTGAACGC GCGAAGCACG CCGCCGGACA GGTAAAAGTA CAGGGTTAA
|
Protein sequence | MTLLGTALRP AATRVMLLGS GELGKEVAIE CQRLGVEVIA VDRYADAPAM HVAHRSHVIN MLDGDALRRV VELEKPHYIV PEIEAIATDM LIQLEEEGLN VVPCARATKL TMNREGIRRL AAEELQLPTS TYRFADSESL FREAVADIGY PCIVKPVMSS SGKGQTFIRS AEQLAQAWKY AQQGGRAGAG RVIVEGVVKF DFEITLLTVS AVDGVHFCAP VGHRQEDGDY RESWQPQQMS PLALERAQEI ARKVVLALGG YGLFGVELFV CGDEVIFSEV SPRPHDTGMV TLISQDLSEF ALHVRAFLGL PVGGIRQYGP AASAVILPQL TSQNVTFDNV QNAVGADLQI RLFGKPEIDG SRRLGVALAT AESVVDAIER AKHAAGQVKV QG
|
| |