Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_00037 |
Symbol | carB |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 34889 |
End bp | 38110 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | carbamoyl-phosphate synthase large subunit |
Protein accession | ACT41939 |
Protein GI | 253976269 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAC GTACAGATAT AAAAAGTATC CTGATTCTGG GTGCGGGCCC GATTGTTATC GGTCAGGCGT GTGAGTTTGA CTACTCTGGC GCGCAAGCAT GTAAAGCCCT GCGCGAAGAG GGTTACCGCG TCATTCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAAATG GCCGATGCGA CCTACATCGA GCCGATTCAC TGGGAAGTAG TACGCAAGAT TATTGAAAAA GAGCGCCCGG ACGCGGTGCT GCCAACGATG GGCGGTCAGA CGGCGCTGAA CTGCGCGCTG GAGCTGGAGC GTCAGGGCGT GTTGGAAGAG TTCGGCGTGA CTATGATTGG TGCGACCGCC GATGCGATTG ATAAAGCAGA AGACCGCCGT CGTTTCGACG TAGCGATGAA GAAAATTGGT CTGGAAACCG CGCGTTCCGG TATCGCACAC ACGATGGAAG AAGCGCTGGC GGTTGCCGCT GACGTGGGCT TCCCGTGCAT TATTCGCCCA TCCTTTACCA TGGGCGGTAG CGGCGGCGGT ATCGCTTATA ACCGCGAAGA GTTTGAAGAA ATTTGCGCCC GCGGTCTGGA TCTCTCCCCA ACCAAAGAGC TGCTGATTGA TGAGTCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG GTGCGTGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT TGATGCGATG GGCATCCATA CCGGTGACTC CATCACTGTC GCGCCAGCCC AAACGCTGAC CGACAAAGAA TATCAAATCA TGCGTAACGC CTCGATGGCG GTGCTGCGTG AAATCGGCGT TGAAACCGGT GGTTCCAATG TCCAGTTTGC GGTGAACCCG AAAAACGGTC GCCTGATTGT TATCGAAATG AACCCACGCG TGTCCCGTTC TTCGGCGCTG GCGTCGAAAG CGACCGGTTT CCCGATTGCT AAAGTGGCGG CGAAACTGGC GGTGGGTTAC ACCCTCGACG AACTGATGAA CGACATCACT GGCGGACGTA CTCCGGCCTC CTTCGAGCCG TCCATCGACT ATGTGGTTAC TAAAATTCCT CGCTTCAACT TCGAAAAATT CGCCGGTGCT AACGACCGTC TGACCACTCA GATGAAATCG GTTGGCGAAG TGATGGCGAT TGGTCGCACG CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC GGCCTGGAAG TCGGTGCGAC TGGATTCGAC CCGAAAGTGA GCCTCGATGA CCCGGAAGCG TTAACCAAAA TCCGTCGCGA ACTGAAAGAT GCTGGCGCAG AGCGTATCTG GTACATCGCC GATGCCTTCC GTGCGGGCCT GTCTGTGGAC GGCGTGTTCA ACCTGACCAA TATTGACCGC TGGTTCCTGG TACAGATTGA AGAACTGGTG CGTCTGGAAG AGAAAGTAGC GGAAGTGGGC ATCACTGGCC TGAACGCTGA ATTCCTGCGC CAGCTGAAAC GCAAAGGCTT TGCCGATGCG CGTCTGGCAA AACTGGCGGG CGTGCGCGAA GCGGAAATCC GCAAGCTGCG CGACCAGTAT GACCTGCACC CGGTTTATAA GCGCGTGGAT ACCTGTGCGG CAGAGTTCGC CACCGACACC GCTTACATGT ACTCCACTTA TGAAGAAGAG TGCGAAGCGA ATCCGTCTAC CGACCGTGAA AAAATCATGG TTCTCGGCGG CGGCCCGAAC CGTATCGGTC AGGGTATCGA ATTCGACTAC TGCTGTGTAC ACGCCTCGCT GGCGCTGCGC GAAGACGGTT ACGAAACCAT TATGGTTAAC TGTAACCCGG AAACCGTCTC CACCGACTAC GACACTTCCG ATCGCCTCTA CTTCGAGCCG GTAACTCTGG AAGATGTGCT GGAAATCGTG CGTATTGAGA AGCCGAAAGG CGTTATCGTT CAGTACGGTG GTCAGACCCC GCTGAAACTG GCGCGCGCGC TGGAAGCTGC TGGCGTACCG GTTATCGGCA CCAGCCCGGA TGCTATCGAC CGTGCAGAAG ACCGTGAACG CTTCCAGCAT GCGGTTGAGC GTCTGAAACT GAAACAACCG GCGAACGCCA CCGTTACCGC TATTGAAATG GCGGTAGAGA AGGCGAAAGA GATTGGCTAC CCGCTGGTGG TACGTCCGTC TTACGTTCTC GGCGGTCGGG CGATGGAAAT CGTCTATGAC GAAGCTGACC TGCGTCGCTA CTTCCAGACG GCGGTCAGCG TGTCTAACGA TGCGCCAGTG TTGCTGGATC ACTTCCTTGA TGACGCGGTA GAAGTTGACG TGGATGCCAT CTGCGACGGC GAAATGGTGC TGATTGGCGG CATCATGGAG CACATTGAGC AGGCGGGCGT GCACTCCGGT GACTCCGCAT GTTCGCTACC AGCCTACACC TTAAGTCAGG AAATTCAGGA TGTGATGCGC CAGCAGGTGC AGAAACTGGC CTTCGAATTG CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTTAAAA ACAACGAAGT CTACCTGATT GAAGTTAACC CGCGTGCGGC GCGTACCGTT CCGTTCGTCT CCAAAGCCAC CGGCGTTCCG CTGGCAAAAG TGGCGGCGCG TGTGATGGCT GGCAAATCGC TGGCTGAGCA GGGCGTAACC AAAGAAGTTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGCTGCCGTT CAATAAATTC CCGGGTGTTG ACCCGCTGTT AGGGCCAGAA ATGCGCTCTA CCGGGGAAGT CATGGGCGTG GGCCGCACCT TCGCTGAAGC GTTCGCCAAA GCGCAGCTGG GCAGCAACTC CACCATGAAG AAACACGGTC GTGCGCTGCT TTCCGTGCGC GAAGGCGATA AAGAACGCGT GGTGGACCTG GCGGCAAAAC TGCTGAAACA GGGCTTCGAG CTGGATGCGA CGCACGGCAC GGCGATTGTG CTGGGTGAAG CGGGTATCAA TCCGCGTCTG GTGAACAAGG TGCATGAAGG CCGTCCGCAC ATTCAGGACC GTATTAAGAA TGGCGAATAT ACCTACATCA TCAACACCAC CTCAGGTCGT CGTGCGATTG AAGACTCCCG CGTGATTCGT CGCAGTGCGC TGCAATATAA AGTGCATTAC GACACCACCC TGAACGGCGG CTTTGCCACC GCGATGGCGC TGAATGCCGA TGCGACTGAA AAAGTAATTT CGGTGCAGGA AATGCACGCA CAGATCAAAT AA
|
Protein sequence | MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA DAIDKAEDRR RFDVAMKKIG LETARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGSGGG IAYNREEFEE ICARGLDLSP TKELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA LTKIRRELKD AGAERIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG ITGLNAEFLR QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT AYMYSTYEEE CEANPSTDRE KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP VIGTSPDAID RAEDRERFQH AVERLKLKQP ANATVTAIEM AVEKAKEIGY PLVVRPSYVL GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDHFLDDAV EVDVDAICDG EMVLIGGIME HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKNNEVYLI EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLAEQGVT KEVIPPYYSV KEVVLPFNKF PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KHGRALLSVR EGDKERVVDL AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTSGR RAIEDSRVIR RSALQYKVHY DTTLNGGFAT AMALNADATE KVISVQEMHA QIK
|
| |