Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0388 |
Symbol | |
ID | 6968679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 395635 |
End bp | 396585 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384440 |
Product | putative carbamate kinase |
Protein accession | YP_002268955 |
Protein GI | 209395980 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0549] Carbamate kinase |
TIGRFAM ID | [TIGR00746] carbamate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.662634 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAC TTGTGGTCGT TGCCATTGGT GGCAACAGCA TTATCAAAGA TAACGCCAGC CAGTCGATTG AGCATCAGGC GGAGGCGGTG AAAGCCGTGG CCGATACGGT GCTGGAAATG CTGGCTTCCG ATTACGACAT TGTGCTGACC CACGGTAACG GGCCGCAGGT CGGGCTGGAT TTACGCCGTG CGGAGATTGC CCACGAGCGC GAAGGGCTGC CCTTAACGCC GCTGGCGAAC TGTGTGGCAG ATACGCAGGG CGGCATTGGC TACCTGATCC AACAGGCGCT GAACAACCGG CTGGCGCGTC ACGGCGAGAA AAAAGCCGTC ACCGTGGTGA CTCAGGTGGA AGTGGATAAA AACGATCCAG GTTTTGCCCA TCCCACCAAG CCCATTGGTG CATTCTTCAG TGAAAGCCAG CGTGACGAAT TACAAAAGGC AAACCCTGAC TGGCGTTTTG TTGAAGATGC TGGACGGGGC TATCGCCGCG TGGTCGCCTC GCCGGAACCG AAACGTATTG TCGAAGCACC CGCCATTAAG GCGCTGATCC AACAAGGCTT TGTGGTGATT GGCGCGGGCG GCGGCGGAAT TCCTGTGGTG CGTACTGACG CGGGAGATTA CCAAAGCGTG GACGCGGTTA TCGACAAAGA TCTCTCTACC GCGCTGCTGG CCCGTGAAAT TCACGCCGAC ATTCTGGTGA TCACCACCGG CGTCGAAAAA GTCTGTATCC ACTTTGGTAA ACCCGAGAAG CAGGCGCTGG ATCGGGTGGA TATTGCCACC ATGACTCGCT ATATGCAGGA AGGACATTTC CCGCCCGGCA GCATGTTGCC AAAAATCATC GCCAGCCTGA CGTTCCTGGA ACAGGGCGGT AAAGAAGTGA TTATCACCAC GCCGGAATGC CTGCCAGCGG CGCTGCGCGG TGAAACGGGC ACCCACATTA TTAAAACGTA A
|
Protein sequence | MKELVVVAIG GNSIIKDNAS QSIEHQAEAV KAVADTVLEM LASDYDIVLT HGNGPQVGLD LRRAEIAHER EGLPLTPLAN CVADTQGGIG YLIQQALNNR LARHGEKKAV TVVTQVEVDK NDPGFAHPTK PIGAFFSESQ RDELQKANPD WRFVEDAGRG YRRVVASPEP KRIVEAPAIK ALIQQGFVVI GAGGGGIPVV RTDAGDYQSV DAVIDKDLST ALLAREIHAD ILVITTGVEK VCIHFGKPEK QALDRVDIAT MTRYMQEGHF PPGSMLPKII ASLTFLEQGG KEVIITTPEC LPAALRGETG THIIKT
|
| |