Gene B21_00036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00036 
SymbolcarB 
ID8113087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp34889 
End bp38110 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content56% 
IMG OID644846331 
Producthypothetical protein 
Protein accessionYP_002997904 
Protein GI251783600 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAC GTACAGATAT AAAAAGTATC CTGATTCTGG GTGCGGGCCC GATTGTTATC 
GGTCAGGCGT GTGAGTTTGA CTACTCTGGC GCGCAAGCAT GTAAAGCCCT GCGCGAAGAG
GGTTACCGCG TCATTCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAAATG
GCCGATGCGA CCTACATCGA GCCGATTCAC TGGGAAGTAG TACGCAAGAT TATTGAAAAA
GAGCGCCCGG ACGCGGTGCT GCCAACGATG GGCGGTCAGA CGGCGCTGAA CTGCGCGCTG
GAGCTGGAGC GTCAGGGCGT GTTGGAAGAG TTCGGCGTGA CTATGATTGG TGCGACCGCC
GATGCGATTG ATAAAGCAGA AGACCGCCGT CGTTTCGACG TAGCGATGAA GAAAATTGGT
CTGGAAACCG CGCGTTCCGG TATCGCACAC ACGATGGAAG AAGCGCTGGC GGTTGCCGCT
GACGTGGGCT TCCCGTGCAT TATTCGCCCA TCCTTTACCA TGGGCGGTAG CGGCGGCGGT
ATCGCTTATA ACCGCGAAGA GTTTGAAGAA ATTTGCGCCC GCGGTCTGGA TCTCTCCCCA
ACCAAAGAGC TGCTGATTGA TGAGTCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG
GTGCGTGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT TGATGCGATG
GGCATCCATA CCGGTGACTC CATCACTGTC GCGCCAGCCC AAACGCTGAC CGACAAAGAA
TATCAAATCA TGCGTAACGC CTCGATGGCG GTGCTGCGTG AAATCGGCGT TGAAACCGGT
GGTTCCAATG TCCAGTTTGC GGTGAACCCG AAAAACGGTC GCCTGATTGT TATCGAAATG
AACCCACGCG TGTCCCGTTC TTCGGCGCTG GCGTCGAAAG CGACCGGTTT CCCGATTGCT
AAAGTGGCGG CGAAACTGGC GGTGGGTTAC ACCCTCGACG AACTGATGAA CGACATCACT
GGCGGACGTA CTCCGGCCTC CTTCGAGCCG TCCATCGACT ATGTGGTTAC TAAAATTCCT
CGCTTCAACT TCGAAAAATT CGCCGGTGCT AACGACCGTC TGACCACTCA GATGAAATCG
GTTGGCGAAG TGATGGCGAT TGGTCGCACG CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC
GGCCTGGAAG TCGGTGCGAC TGGATTCGAC CCGAAAGTGA GCCTCGATGA CCCGGAAGCG
TTAACCAAAA TCCGTCGCGA ACTGAAAGAT GCTGGCGCAG AGCGTATCTG GTACATCGCC
GATGCCTTCC GTGCGGGCCT GTCTGTGGAC GGCGTGTTCA ACCTGACCAA TATTGACCGC
TGGTTCCTGG TACAGATTGA AGAACTGGTG CGTCTGGAAG AGAAAGTAGC GGAAGTGGGC
ATCACTGGCC TGAACGCTGA ATTCCTGCGC CAGCTGAAAC GCAAAGGCTT TGCCGATGCG
CGTCTGGCAA AACTGGCGGG CGTGCGCGAA GCGGAAATCC GCAAGCTGCG CGACCAGTAT
GACCTGCACC CGGTTTATAA GCGCGTGGAT ACCTGTGCGG CAGAGTTCGC CACCGACACC
GCTTACATGT ACTCCACTTA TGAAGAAGAG TGCGAAGCGA ATCCGTCTAC CGACCGTGAA
AAAATCATGG TTCTCGGCGG CGGCCCGAAC CGTATCGGTC AGGGTATCGA ATTCGACTAC
TGCTGTGTAC ACGCCTCGCT GGCGCTGCGC GAAGACGGTT ACGAAACCAT TATGGTTAAC
TGTAACCCGG AAACCGTCTC CACCGACTAC GACACTTCCG ATCGCCTCTA CTTCGAGCCG
GTAACTCTGG AAGATGTGCT GGAAATCGTG CGTATTGAGA AGCCGAAAGG CGTTATCGTT
CAGTACGGTG GTCAGACCCC GCTGAAACTG GCGCGCGCGC TGGAAGCTGC TGGCGTACCG
GTTATCGGCA CCAGCCCGGA TGCTATCGAC CGTGCAGAAG ACCGTGAACG CTTCCAGCAT
GCGGTTGAGC GTCTGAAACT GAAACAACCG GCGAACGCCA CCGTTACCGC TATTGAAATG
GCGGTAGAGA AGGCGAAAGA GATTGGCTAC CCGCTGGTGG TACGTCCGTC TTACGTTCTC
GGCGGTCGGG CGATGGAAAT CGTCTATGAC GAAGCTGACC TGCGTCGCTA CTTCCAGACG
GCGGTCAGCG TGTCTAACGA TGCGCCAGTG TTGCTGGATC ACTTCCTTGA TGACGCGGTA
GAAGTTGACG TGGATGCCAT CTGCGACGGC GAAATGGTGC TGATTGGCGG CATCATGGAG
CACATTGAGC AGGCGGGCGT GCACTCCGGT GACTCCGCAT GTTCGCTACC AGCCTACACC
TTAAGTCAGG AAATTCAGGA TGTGATGCGC CAGCAGGTGC AGAAACTGGC CTTCGAATTG
CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTTAAAA ACAACGAAGT CTACCTGATT
GAAGTTAACC CGCGTGCGGC GCGTACCGTT CCGTTCGTCT CCAAAGCCAC CGGCGTTCCG
CTGGCAAAAG TGGCGGCGCG TGTGATGGCT GGCAAATCGC TGGCTGAGCA GGGCGTAACC
AAAGAAGTTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGCTGCCGTT CAATAAATTC
CCGGGTGTTG ACCCGCTGTT AGGGCCAGAA ATGCGCTCTA CCGGGGAAGT CATGGGCGTG
GGCCGCACCT TCGCTGAAGC GTTCGCCAAA GCGCAGCTGG GCAGCAACTC CACCATGAAG
AAACACGGTC GTGCGCTGCT TTCCGTGCGC GAAGGCGATA AAGAACGCGT GGTGGACCTG
GCGGCAAAAC TGCTGAAACA GGGCTTCGAG CTGGATGCGA CGCACGGCAC GGCGATTGTG
CTGGGTGAAG CGGGTATCAA TCCGCGTCTG GTGAACAAGG TGCATGAAGG CCGTCCGCAC
ATTCAGGACC GTATTAAGAA TGGCGAATAT ACCTACATCA TCAACACCAC CTCAGGTCGT
CGTGCGATTG AAGACTCCCG CGTGATTCGT CGCAGTGCGC TGCAATATAA AGTGCATTAC
GACACCACCC TGAACGGCGG CTTTGCCACC GCGATGGCGC TGAATGCCGA TGCGACTGAA
AAAGTAATTT CGGTGCAGGA AATGCACGCA CAGATCAAAT AA
 
Protein sequence
MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM 
ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA
DAIDKAEDRR RFDVAMKKIG LETARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGSGGG
IAYNREEFEE ICARGLDLSP TKELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM
GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP
RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA
LTKIRRELKD AGAERIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG
ITGLNAEFLR QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT
AYMYSTYEEE CEANPSTDRE KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN
CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP
VIGTSPDAID RAEDRERFQH AVERLKLKQP ANATVTAIEM AVEKAKEIGY PLVVRPSYVL
GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDHFLDDAV EVDVDAICDG EMVLIGGIME
HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKNNEVYLI
EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLAEQGVT KEVIPPYYSV KEVVLPFNKF
PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KHGRALLSVR EGDKERVVDL
AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTSGR
RAIEDSRVIR RSALQYKVHY DTTLNGGFAT AMALNADATE KVISVQEMHA QIK