Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3622 |
Symbol | carB |
ID | 6065897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3964185 |
End bp | 3967406 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641603040 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_001726563 |
Protein GI | 170021609 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.443028 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAC GTACAGATAT AAAAAGTATC CTGATTCTGG GTGCGGGCCC GATTGTTATC GGTCAGGCGT GTGAGTTTGA CTACTCTGGC GCGCAAGCGT GTAAAGCCCT GCGTGAAGAG GGTTACCGCG TCATTCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAAATG GCTGATGCAA CCTACATCGA GCCGATTCAC TGGGAAGTTG TACGCAAGAT TATTGAAAAA GAGCGCCCGG ACGCGGTGCT GCCAACGATG GGCGGTCAGA CGGCGCTGAA CTGCGCGCTG GAGCTGGAAC GTCAGGGCGT GTTGGAAGAG TTCGGTGTCA CCATGATTGG TGCCACTGCC GATGCGATTG ATAAAGCAGA AGACCGCCGT CGTTTCGACG TAGCGATGAA GAAAATTGGT CTGGAAACCG CGCGTTCCGG TATCGCACAC ACGATGGAAG AAGCGCTGGC GGTTGCCGCT GACGTGGGCT TCCCGTGCAT TATTCGCCCA TCCTTTACCA TGGGCGGTAG CGGCGGCGGT ATCGCTTATA ACCGTGAAGA GTTTGAAGAA ATTTGCGCCC GCGGTCTGGA TCTCTCTCCG ACCAAAGAGT TGCTGATTGA TGAGTCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG GTGCGTGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT CGATGCGATG GGCATCCACA CCGGTGACTC CATCACTGTC GCGCCAGCCC AAACGCTGAC CGACAAAGAA TATCAAATCA TGCGTAACGC CTCGATGGCG GTGCTGCGTG AAATCGGCGT TGAAACCGGT GGTTCCAACG TTCAGTTTGC GGTGAACCCG AAAAACGGTC GTCTGATTGT TATCGAAATG AACCCACGCG TGTCCCGTTC TTCGGCGCTG GCGTCGAAAG CGACCGGTTT CCCGATTGCT AAAGTGGCGG CGAAACTGGC GGTGGGTTAC ACCCTCGACG AACTGATGAA CGACATCACT GGCGGACGTA CTCCGGCCTC CTTCGAGCCG TCCATCGACT ATGTGGTTAC TAAAATTCCT CGCTTCAACT TCGAAAAATT CGCCGGTGCT AACGACCGTC TGACCACTCA GATGAAATCG GTTGGCGAAG TGATGGCGAT TGGTCGCACG CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC GGCCTGGAAG TCGGTGCGAC TGGATTCGAC CCGAAAGTGA GCCTGGATGA CCCGGAAGCG TTAACCAAAA TCCGTCGCGA ACTGAAAGAC GCAGGCGCAG ATCGTATCTG GTACATCGCC GATGCGTTCC GTGCGGGCCT GTCTGTGGAC GGCGTCTTCA ACCTGACCAA TATTGACCGC TGGTTCCTGG TACAGATTGA AGAACTGGTG CGTCTGGAAG AGAAAGTAGC GGAAGTGGGC ATCACTGGCC TGAACGCTGA ATTCCTGCGC CAGCTGAAAC GCAAAGGCTT TGCCGATGCG CGTCTGGCAA AACTGGCGGG CGTGCGCGAA GCGGAAATCC GCAAGCTGCG CGACCAGTAT GACCTGCACC CGGTTTATAA GCGCGTGGAT ACCTGTGCGG CAGAGTTCGC CACCGACACC GCTTACATGT ACTCCACTTA TGAAGAAGAG TGCGAAGCGA ATCCGTCTAC CGACCGTGAA AAAATCATGG TTCTCGGCGG CGGCCCGAAC CGTATCGGTC AGGGTATCGA ATTCGACTAC TGCTGTGTAC ACGCCTCGCT GGCGCTGCGC GAAGACGGTT ACGAAACCAT TATGGTTAAC TGTAACCCGG AAACCGTCTC CACCGACTAC GACACTTCCG ACCGCCTCTA CTTCGAGCCG GTAACCCTGG AAGATGTGCT GGAAATCGTG CGTATTGAGA AGCCGAAAGG CGTTATCGTT CAGTACGGCG GTCAGACCCC GCTGAAACTG GCGCGTGCAC TGGAAGCCGC TGGCGTACCG GTTATCGGCA CCAGCCCGGA TGCTATCGAC CGTGCGGAAG ACCGTGAACG CTTCCAGCAC GCGGTTGACC GCCTGAAACT GAAGCAACCG GCGAACGCCA CCGTTACCGC TATCGAAATG GCGGTAGAGA AGGCGAAAGA GATTGGCTAC CCGCTGGTGG TACGTCCGTC TTACGTTCTC GGCGGTCGGG CGATGGAAAT CGTCTATGAC GAAGCAGACC TGCGTCGCTA CTTCCAGACG GCGGTCAGCG TGTCTAACGA TGCGCCAGTG TTGCTGGACC ACTTCCTCGA TGACGCGGTA GAAGTGGATG TGGACGCCAT CTGCGACGGC GAAATGGTGC TGATTGGCGG CATCATGGAG CATATTGAGC AGGCGGGCGT GCACTCCGGT GACTCCGCAT GTTCGCTGCC AGCCTACACT TTAAGTCAGG AAATTCAGGA TGTGATGCGC CAGCAGGTGC AGAAACTGGC CTTCGAATTG CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTGAAAA ACAACGAAGT CTACCTGATT GAAGTTAACC CGCGTGCGGC GCGTACCGTT CCGTTCGTCT CCAAAGCCAC CGGCGTACCG CTGGCAAAAG TGGCGGCGCG CGTGATGGCT GGCAAATCGC TGGCTGAGCA GGGCGTAACC AAAGAAGTTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGCTGCCGTT CAATAAATTC CCGGGCGTTG ACCCGCTGTT AGGGCCAGAA ATGCGCTCTA CCGGGGAAGT CATGGGCGTG GGCCGCACCT TCGCTGAAGC GTTTGCCAAA GCGCAGCTGG GCAGCAACTC CACCATGAAG AAACACGGTC GTGCGCTGCT TTCCGTGCGC GAAGGCGATA AAGAACGCGT GGTGGACCTG GCGGCAAAAC TGCTGAAACA GGGCTTCGAG CTGGATGCGA CCCACGGCAC GGCGATTGTG CTGGGCGAAG CAGGTATCAA CCCGCGTCTG GTAAACAAGG TGCATGAAGG CCGTCCGCAC ATTCAGGACC GTATCAAGAA TGGCGAATAT ACCTACATCA TCAACACCAC CTCAGGCCGT CGTGCGATTG AAGACTCCCG CGTGATTCGT CGCAGTGCGC TGCAATATAA AGTGCATTAC GACACCACCC TGAACGGCGG CTTTGCCACC GCGATGGCGC TGAATGCCGA TGCGACTGAA AAAGTAATTT CGGTGCAGGA AATGCACGCA CAGATCAAAT AA
|
Protein sequence | MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA DAIDKAEDRR RFDVAMKKIG LETARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGSGGG IAYNREEFEE ICARGLDLSP TKELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA LTKIRRELKD AGADRIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG ITGLNAEFLR QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT AYMYSTYEEE CEANPSTDRE KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP VIGTSPDAID RAEDRERFQH AVDRLKLKQP ANATVTAIEM AVEKAKEIGY PLVVRPSYVL GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDHFLDDAV EVDVDAICDG EMVLIGGIME HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKNNEVYLI EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLAEQGVT KEVIPPYYSV KEVVLPFNKF PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KHGRALLSVR EGDKERVVDL AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTSGR RAIEDSRVIR RSALQYKVHY DTTLNGGFAT AMALNADATE KVISVQEMHA QIK
|
| |