Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0036 |
Symbol | carB |
ID | 6971051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 35192 |
End bp | 38413 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384117 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_002268640 |
Protein GI | 209396701 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAC GTACAGATAT AAAAAGTATC CTGATTCTGG GTGCGGGCCC GATTGTTATC GGTCAGGCGT GTGAGTTTGA CTACTCTGGC GCGCAAGCAT GTAAAGCCCT GCGCGAAGAG GGTTACCGCG TCATTCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAAATG GCCGATGCGA CCTACATCGA GCCGATTCAC TGGGAAGTGG TACGTAAGAT TATTGAAAAA GAGCGCCCGG ACGCGGTGCT GCCAACCATG GGCGGTCAGA CGGCGCTGAA CTGCGCGCTG GAGCTGGAAC GTCAGGGCGT GTTGGAAGAG TTCGGCGTCA CCATGATTGG TGCCACTGCC GATGCGATTG ATAAAGCAGA AGACCGCCGT CGTTTCGACG TAGCGATGAA GAAAATTGGT CTGGAAACCG CGCGTTCCGG TATCGCACAT ACGATGGAAG AAGCGCTGGC GGTTGCCGCT GACGTGGGCT TCCCGTGCAT TATTCGCCCA TCCTTTACCA TGGGCGGTAG CGGCGGCGGT ATCGCTTATA ACCGCGAAGA GTTTGAAGAA ATTTGCGCCC GCGGTCTGGA TCTCTCTCCG ACCAAAGAGT TGCTGATTGA TGAGTCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG GTGCGCGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT TGATGCGATG GGCATCCACA CCGGTGACTC CATCACTGTC GCGCCAGCCC AAACGCTGAC CGACAAAGAA TATCAAATCA TGCGTAACGC CTCGATGGCG GTGCTGCGTG AAATCGGCGT TGAAACCGGT GGTTCCAATG TCCAGTTTGC GGTGAACCCG AAAAACGGTC GCCTGATTGT TATCGAAATG AACCCACGCG TGTCCCGTTC TTCGGCGCTG GCGTCGAAAG CGACCGGTTT CCCGATTGCT AAAGTGGCGG CGAAACTGGC GGTGGGTTAC ACCCTCGATG AACTGATGAA CGACATCACT GGCGGACGTA CTCCGGCCTC CTTCGAGCCG TCCATCGATT ACGTGGTTAC CAAAATTCCT CGCTTCAACT TCGAAAAATT CGCCGGTGCT AACGACCGTC TGACCACTCA GATGAAATCG GTTGGCGAAG TGATGGCGAT TGGTCGCACG CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC GGCCTGGAAG TCGGTGCGAC TGGATTCGAC CCGAAAGTGA GCCTGGATGA CCCGGAAGCG TTAACCAAAA TCCGTCGCGA ACTGAAAGAC GCAGGCGCAG AGCGTATCTG GTACATCGCC GATGCTTTCC GCGCGGGCCT GTCTGTGGAC GGCGTCTTCA ACCTGACTAA CATTGACCGC TGGTTCCTGG TACAGATTGA AGAACTGGTG CGTCTGGAAG AGAAAGTGGC GGAAGTGGGC ATCACTGGCC TGAACGCTGA ATTCCTGCGC CAGCTGAAAC GCAAAGGCTT TGCCGATGCG CGCTTGGCAA AACTGGCGGG CGTACGCGAA GCGGAAATCC GTAAGCTGCG TGACCAATAT GACCTGCACC CGGTCTACAA GCGCGTGGAT ACCTGTGCGG CAGAGTTTGC CACCGACACC GCTTACATGT ACTCCACTTA TGAAGAAGAG TGCGAAGCGA ATCCGTCTAC CGACCGTGAA AAAATCATGG TGCTTGGCGG CGGTCCAAAC CGTATCGGTC AGGGTATCGA ATTCGACTAC TGCTGCGTAC ACGCCTCGCT GGCGCTGCGT GAAGACGGTT ACGAAACCAT TATGGTTAAC TGTAACCCGG AAACCGTCTC TACCGACTAC GACACTTCCG ATCGCCTCTA CTTCGAGCCG GTAACTCTGG AAGATGTGCT GGAAATCGTG CGTATTGAGA AGCCGAAAGG CGTTATCGTT CAGTACGGCG GTCAGACCCC GCTGAAACTG GCGCGTGCAC TGGAAGCCGC TGGCGTACCG GTTATCGGCA CCAGCCCGGA TGCTATCGAC CGTGCGGAAG ACCGTGAACG CTTCCAGCAT GCGGTTGACC GTCTGAAACT GAAACAACCG GCGAACGCCA CCGTTACTGC TATTGAAATG GCAGTTGAGA AGGCAAAAGA GATTGGCTAC CCGCTGGTGG TGCGTCCGTC TTACGTTCTC GGCGGTCGGG CGATGGAAAT CGTCTATGAC GAAGCTGACC TGCGTCGCTA CTTCCAGACG GCGGTTAGCG TGTCTAACGA TGCGCCAGTG TTGCTGGATC ATTTCCTTGA TGACGCAGTA GAAGTTGACG TGGATGCCAT CTGCGACGGC GAAATGGTGC TGATTGGCGG CATCATGGAG CACATCGAGC AGGCGGGCGT GCACTCCGGT GACTCCGCAT GTTCGCTGCC AGCGTACACC TTAAGTCAGG AAATTCAGGA TGTAATGCGC CAGCAGGTGC AGAAACTGGC CTTCGAATTG CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTGAAAA ACAACGAAGT CTACCTGATT GAAGTTAACC CGCGTGCGGC GCGTACCGTT CCGTTCGTCT CCAAAGCCAC CGGCGTACCG CTGGCAAAAG TGGCGGCGCG TGTGATGGCT GGCAAATCGC TGGCTGAGCA GGGCGTAACC AAAGAAGTTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGCTGCCGTT CAATAAATTC CCGGGCGTTG ACCCGCTGTT AGGGCCAGAA ATGCGTTCTA CCGGGGAAGT CATGGGCGTG GGCCGCACCT TCGCTGAAGC GTTTGCCAAA GCGCAGCTGG GCAGCAACTC CACCATGAAG AAACACGGTC GTGCGCTGCT TTCCGTGCGC GAAGGCGATA AAGAACGCGT GGTGGACCTG GCGGCAAAAC TGCTGAAACA GGGCTTCGAG CTGGATGCGA CCCACGGCAC GGCGATTGTG CTGGGCGAAG CGGGTATCAA TCCGCGTCTG GTAAACAAGG TGCATGAAGG CCGTCCGCAC ATTCAGGACC GTATCAAGAA TGGCGAATAT ACCTACATCA TCAACACCAC CTCAGGCCGT CGTGCGATTG AAGACTCCCG CGTGATCCGT CGCAGTGCGC TGCAATATAA AGTGCATTAT GACACCACCC TGAACGGTGG TTTCGCTACC GCGATGGCGC TGAATGCCGA TGCGACTGAA AAAGTAATTT CGGTGCAGGA AATGCACGCA CAGATCAAAT AA
|
Protein sequence | MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA DAIDKAEDRR RFDVAMKKIG LETARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGSGGG IAYNREEFEE ICARGLDLSP TKELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA LTKIRRELKD AGAERIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG ITGLNAEFLR QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT AYMYSTYEEE CEANPSTDRE KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP VIGTSPDAID RAEDRERFQH AVDRLKLKQP ANATVTAIEM AVEKAKEIGY PLVVRPSYVL GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDHFLDDAV EVDVDAICDG EMVLIGGIME HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKNNEVYLI EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLAEQGVT KEVIPPYYSV KEVVLPFNKF PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KHGRALLSVR EGDKERVVDL AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTSGR RAIEDSRVIR RSALQYKVHY DTTLNGGFAT AMALNADATE KVISVQEMHA QIK
|
| |