Gene ECH74115_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0036 
SymbolcarB 
ID6971051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp35192 
End bp38413 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content56% 
IMG OID643384117 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_002268640 
Protein GI209396701 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase
[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC GTACAGATAT AAAAAGTATC CTGATTCTGG GTGCGGGCCC GATTGTTATC 
GGTCAGGCGT GTGAGTTTGA CTACTCTGGC GCGCAAGCAT GTAAAGCCCT GCGCGAAGAG
GGTTACCGCG TCATTCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAAATG
GCCGATGCGA CCTACATCGA GCCGATTCAC TGGGAAGTGG TACGTAAGAT TATTGAAAAA
GAGCGCCCGG ACGCGGTGCT GCCAACCATG GGCGGTCAGA CGGCGCTGAA CTGCGCGCTG
GAGCTGGAAC GTCAGGGCGT GTTGGAAGAG TTCGGCGTCA CCATGATTGG TGCCACTGCC
GATGCGATTG ATAAAGCAGA AGACCGCCGT CGTTTCGACG TAGCGATGAA GAAAATTGGT
CTGGAAACCG CGCGTTCCGG TATCGCACAT ACGATGGAAG AAGCGCTGGC GGTTGCCGCT
GACGTGGGCT TCCCGTGCAT TATTCGCCCA TCCTTTACCA TGGGCGGTAG CGGCGGCGGT
ATCGCTTATA ACCGCGAAGA GTTTGAAGAA ATTTGCGCCC GCGGTCTGGA TCTCTCTCCG
ACCAAAGAGT TGCTGATTGA TGAGTCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG
GTGCGCGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT TGATGCGATG
GGCATCCACA CCGGTGACTC CATCACTGTC GCGCCAGCCC AAACGCTGAC CGACAAAGAA
TATCAAATCA TGCGTAACGC CTCGATGGCG GTGCTGCGTG AAATCGGCGT TGAAACCGGT
GGTTCCAATG TCCAGTTTGC GGTGAACCCG AAAAACGGTC GCCTGATTGT TATCGAAATG
AACCCACGCG TGTCCCGTTC TTCGGCGCTG GCGTCGAAAG CGACCGGTTT CCCGATTGCT
AAAGTGGCGG CGAAACTGGC GGTGGGTTAC ACCCTCGATG AACTGATGAA CGACATCACT
GGCGGACGTA CTCCGGCCTC CTTCGAGCCG TCCATCGATT ACGTGGTTAC CAAAATTCCT
CGCTTCAACT TCGAAAAATT CGCCGGTGCT AACGACCGTC TGACCACTCA GATGAAATCG
GTTGGCGAAG TGATGGCGAT TGGTCGCACG CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC
GGCCTGGAAG TCGGTGCGAC TGGATTCGAC CCGAAAGTGA GCCTGGATGA CCCGGAAGCG
TTAACCAAAA TCCGTCGCGA ACTGAAAGAC GCAGGCGCAG AGCGTATCTG GTACATCGCC
GATGCTTTCC GCGCGGGCCT GTCTGTGGAC GGCGTCTTCA ACCTGACTAA CATTGACCGC
TGGTTCCTGG TACAGATTGA AGAACTGGTG CGTCTGGAAG AGAAAGTGGC GGAAGTGGGC
ATCACTGGCC TGAACGCTGA ATTCCTGCGC CAGCTGAAAC GCAAAGGCTT TGCCGATGCG
CGCTTGGCAA AACTGGCGGG CGTACGCGAA GCGGAAATCC GTAAGCTGCG TGACCAATAT
GACCTGCACC CGGTCTACAA GCGCGTGGAT ACCTGTGCGG CAGAGTTTGC CACCGACACC
GCTTACATGT ACTCCACTTA TGAAGAAGAG TGCGAAGCGA ATCCGTCTAC CGACCGTGAA
AAAATCATGG TGCTTGGCGG CGGTCCAAAC CGTATCGGTC AGGGTATCGA ATTCGACTAC
TGCTGCGTAC ACGCCTCGCT GGCGCTGCGT GAAGACGGTT ACGAAACCAT TATGGTTAAC
TGTAACCCGG AAACCGTCTC TACCGACTAC GACACTTCCG ATCGCCTCTA CTTCGAGCCG
GTAACTCTGG AAGATGTGCT GGAAATCGTG CGTATTGAGA AGCCGAAAGG CGTTATCGTT
CAGTACGGCG GTCAGACCCC GCTGAAACTG GCGCGTGCAC TGGAAGCCGC TGGCGTACCG
GTTATCGGCA CCAGCCCGGA TGCTATCGAC CGTGCGGAAG ACCGTGAACG CTTCCAGCAT
GCGGTTGACC GTCTGAAACT GAAACAACCG GCGAACGCCA CCGTTACTGC TATTGAAATG
GCAGTTGAGA AGGCAAAAGA GATTGGCTAC CCGCTGGTGG TGCGTCCGTC TTACGTTCTC
GGCGGTCGGG CGATGGAAAT CGTCTATGAC GAAGCTGACC TGCGTCGCTA CTTCCAGACG
GCGGTTAGCG TGTCTAACGA TGCGCCAGTG TTGCTGGATC ATTTCCTTGA TGACGCAGTA
GAAGTTGACG TGGATGCCAT CTGCGACGGC GAAATGGTGC TGATTGGCGG CATCATGGAG
CACATCGAGC AGGCGGGCGT GCACTCCGGT GACTCCGCAT GTTCGCTGCC AGCGTACACC
TTAAGTCAGG AAATTCAGGA TGTAATGCGC CAGCAGGTGC AGAAACTGGC CTTCGAATTG
CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTGAAAA ACAACGAAGT CTACCTGATT
GAAGTTAACC CGCGTGCGGC GCGTACCGTT CCGTTCGTCT CCAAAGCCAC CGGCGTACCG
CTGGCAAAAG TGGCGGCGCG TGTGATGGCT GGCAAATCGC TGGCTGAGCA GGGCGTAACC
AAAGAAGTTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGCTGCCGTT CAATAAATTC
CCGGGCGTTG ACCCGCTGTT AGGGCCAGAA ATGCGTTCTA CCGGGGAAGT CATGGGCGTG
GGCCGCACCT TCGCTGAAGC GTTTGCCAAA GCGCAGCTGG GCAGCAACTC CACCATGAAG
AAACACGGTC GTGCGCTGCT TTCCGTGCGC GAAGGCGATA AAGAACGCGT GGTGGACCTG
GCGGCAAAAC TGCTGAAACA GGGCTTCGAG CTGGATGCGA CCCACGGCAC GGCGATTGTG
CTGGGCGAAG CGGGTATCAA TCCGCGTCTG GTAAACAAGG TGCATGAAGG CCGTCCGCAC
ATTCAGGACC GTATCAAGAA TGGCGAATAT ACCTACATCA TCAACACCAC CTCAGGCCGT
CGTGCGATTG AAGACTCCCG CGTGATCCGT CGCAGTGCGC TGCAATATAA AGTGCATTAT
GACACCACCC TGAACGGTGG TTTCGCTACC GCGATGGCGC TGAATGCCGA TGCGACTGAA
AAAGTAATTT CGGTGCAGGA AATGCACGCA CAGATCAAAT AA
 
Protein sequence
MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM 
ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA
DAIDKAEDRR RFDVAMKKIG LETARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGSGGG
IAYNREEFEE ICARGLDLSP TKELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM
GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP
RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA
LTKIRRELKD AGAERIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG
ITGLNAEFLR QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT
AYMYSTYEEE CEANPSTDRE KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN
CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP
VIGTSPDAID RAEDRERFQH AVDRLKLKQP ANATVTAIEM AVEKAKEIGY PLVVRPSYVL
GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDHFLDDAV EVDVDAICDG EMVLIGGIME
HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKNNEVYLI
EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLAEQGVT KEVIPPYYSV KEVVLPFNKF
PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KHGRALLSVR EGDKERVVDL
AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTSGR
RAIEDSRVIR RSALQYKVHY DTTLNGGFAT AMALNADATE KVISVQEMHA QIK