Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0353 |
Symbol | |
ID | 6146364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 365085 |
End bp | 366035 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615249 |
Product | putative carbamate kinase |
Protein accession | YP_001742457 |
Protein GI | 170682847 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0549] Carbamate kinase |
TIGRFAM ID | [TIGR00746] carbamate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAC TTGTGGTCGT TGCCATTGGT GGCAACAGCA TTATCAAAGA TAACGCCAGC CAGTCGATTG AGCATCAGGC GGAGGCGGTG AAGGCCGTCG CCGATACGGT GCTGGAAATG CTGGCTTCCG ATTACGACAT TGTGCTGACC CACGGCAACG GACCGCAGGT CGGTTTAGAT TTACGCCGCG CGGAGATTGC CCACGAGCGC GAAGGGCTGC CCTTAACGCC GCTGGCGAAC TGTGTGGCGG ATACGCAAGG CGGCATTGGC TATCTGATCC AGCAGGCGCT GAATAACCGG CTGGCGCGTC ACGGCGAGAA AAAAGCCGTC ACCGTGGTGA CTCAGGTGGA AGTGGATAAA AACGATCCGG GGTTTGCCCA TCCCACCAAG CCCATCGGCG CATTCTTTAG TGAAAGCCAG CGTGACGAAC TACAAAAGGC AAACCCTGAC TGGCGTTTTG TTGAAGATGC CGGACGGGGC TATCGCCGCG TGGTCGCCTC GCCGGAACCG AAACGTATTG TCGAAGCACC TGCCATTAAG GCGCTGATCC AACAAGGTTT TGTGGTGATT GGCGCGGGCG GCGGTGGAAT TCCGGTAGTG CGTACTGAAG CGGGGGATTA CCAAAGCGTG GACGCGGTTA TCGACAAAGA TCTCTCTACC GCGCTATTGG CCCGTGAAAT TCACGCCGAC ATTCTGGTGA TCACCACTGG CGTGGAAAAA GTGTGTATTC ACTTTGGCAA ACCGCAGCAG CAGGTGCTGG ATCGGGTGGA TATTGCCACC ATGACCCGCT ATATGCAGGA AGGGCATTTC CCACCCGGCA GCATGTTGCC AAAAATCATC GCCAGCCTGA CGTTCCTGGA GCAGGGCGGT AAAGAAGTGA TTATCACCAC GCCGGAATGC CTGCCTGCGG CGCTACGCGG CGAAACGGGC ACCCATATTA TTAAAACCTA A
|
Protein sequence | MKELVVVAIG GNSIIKDNAS QSIEHQAEAV KAVADTVLEM LASDYDIVLT HGNGPQVGLD LRRAEIAHER EGLPLTPLAN CVADTQGGIG YLIQQALNNR LARHGEKKAV TVVTQVEVDK NDPGFAHPTK PIGAFFSESQ RDELQKANPD WRFVEDAGRG YRRVVASPEP KRIVEAPAIK ALIQQGFVVI GAGGGGIPVV RTEAGDYQSV DAVIDKDLST ALLAREIHAD ILVITTGVEK VCIHFGKPQQ QVLDRVDIAT MTRYMQEGHF PPGSMLPKII ASLTFLEQGG KEVIITTPEC LPAALRGETG THIIKT
|
| |