Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4726 |
Symbol | pyrB |
ID | 6146493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4826680 |
End bp | 4827615 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619542 |
Product | aspartate carbamoyltransferase catalytic subunit |
Protein accession | YP_001746650 |
Protein GI | 170680939 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0540] Aspartate carbamoyltransferase, catalytic chain |
TIGRFAM ID | [TIGR00670] aspartate carbamoyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAATC CGCTATATCA GAAACATATC ATTTCCATAA ACGACCTTAG TCGCGATGAC CTTAATCTGG TGCTGGCGAC AGCGGCGAAA CTGAAAGCAA ACCCGCAACC AGAGCTGTTG AAGCACAAAG TCATTGCCAG CTGCTTCTTC GAAGCCTCTA CCCGTACCCG CCTCTCTTTC GAAACTTCCA TGCACCGCCT GGGTGCCAGC GTGGTGGGCT TCTCCGACAG CGCCAATACA TCACTGGGTA AAAAAGGCGA AACGCTGGCC GATACCATTT CGGTTATCAG CACTTACGTC GATGCGATAG TGATGCGTCA TCCGCAGGAA GGTGCGGCGC GCCTGGCCAC CGAATTTTCC GGCAATGTAC CGGTACTGAA TGCCGGTGAT GGCTCCAACC AACATCCGAC GCAAACCTTG CTGGACTTAT TCACCATTCA GGAAACCCAG GGGCGTCTGG ACAATCTCCA CGTCGCAATG GTTGGTGACC TGAAATATGG CCGCACCGTT CACTCCCTGA CCCAGGCGCT AGCGAAGTTC GACGGCAACC GTTTTTACTT TATCGCGCCG GATGCGCTGG CAATGCCGCA ATACATTCTG GATATGCTCG ATGAAAAAGG GATCGCCTGG AGTCTGCACA GCTCTATTGA AGAAGTGATG GCGGAAGTAG ACATCCTGTA CATGACCCGC GTGCAAAAAG AGCGTCTGGA CCCGTCCGAG TACGCCAACG TGAAAGCGCA GTTTGTTCTT CGCGCCAGCG ATCTCCACAA CGCCAAAGCC AATATGAAAG TGCTGCATCC GCTGCCGCGT GTTGATGAGA TTGCCACCGA TGTTGATAAA ACGCCGCACG CCTGGTACTT CCAGCAGGCA GGCAACGGGA TTTTCGCTCG CCAGGCGTTA CTGGCACTGG TTCTGAATCG CGATCTGGTA CTGTAA
|
Protein sequence | MANPLYQKHI ISINDLSRDD LNLVLATAAK LKANPQPELL KHKVIASCFF EASTRTRLSF ETSMHRLGAS VVGFSDSANT SLGKKGETLA DTISVISTYV DAIVMRHPQE GAARLATEFS GNVPVLNAGD GSNQHPTQTL LDLFTIQETQ GRLDNLHVAM VGDLKYGRTV HSLTQALAKF DGNRFYFIAP DALAMPQYIL DMLDEKGIAW SLHSSIEEVM AEVDILYMTR VQKERLDPSE YANVKAQFVL RASDLHNAKA NMKVLHPLPR VDEIATDVDK TPHAWYFQQA GNGIFARQAL LALVLNRDLV L
|
| |