Gene EcHS_A0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0036 
SymbolcarB 
ID5593831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp34076 
End bp37297 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content56% 
IMG OID640919224 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001456819 
Protein GI157159501 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase
[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAC GTACAGATAT AAAAAGTATC CTGATTCTGG GTGCGGGCCC GATTGTTATC 
GGTCAGGCGT GTGAGTTTGA CTACTCTGGC GCGCAAGCAT GTAAAGCCCT GCGCGAAGAG
GGTTACCGCG TCATTCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAAATG
GCCGATGCGA CCTACATCGA GCCGATTCAC TGGGAAGTAG TACGCAAGAT TATTGAAAAA
GAGCGCCCGG ACGCGGTGCT GCCAACGATG GGCGGTCAGA CGGCGCTGAA CTGCGCGCTG
GAGCTGGAGC GTCAGGGCGT GTTGGAAGAG TTCGGCGTGA CTATGATTGG TGCGACCGCC
GATGCGATTG ATAAAGCAGA AGACCGCCGT CGTTTCGACG TAGCGATGAA GAAAATTGGT
CTGGAAACCG CGCGTTCCGG TATCGCACAC ACGATGGAAG AAGCGCTGGC GGTTGCCGCT
GACGTGGGCT TCCCGTGCAT TATTCGCCCA TCCTTTACCA TGGGCGGTAG CGGCGGCGGT
ATCGCTTATA ACCGCGAAGA GTTTGAAGAA ATTTGCGCCC GCGGTCTGGA TCTCTCCCCA
ACCAAAGAGC TGCTGATTGA TGAGTCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG
GTGCGTGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT TGATGCGATG
GGCATCCATA CCGGTGACTC CATCACTGTC GCGCCAGCCC AAACGCTGAC CGACAAAGAA
TATCAAATCA TGCGTAACGC CTCGATGGCG GTGCTGCGTG AAATCGGCGT TGAAACCGGT
GGTTCCAATG TCCAGTTTGC GGTGAACCCG AAAAACGGTC GCCTGATTGT TATCGAAATG
AACCCACGCG TGTCCCGTTC TTCGGCGCTG GCGTCGAAAG CGACCGGTTT CCCGATTGCT
AAAGTGGCGG CGAAACTGGC GGTGGGTTAC ACCCTCGACG AACTGATGAA CGACATCACT
GGCGGACGTA CTCCGGCCTC CTTCGAGCCG TCCATCGACT ATGTGGTTAC TAAAATTCCT
CGCTTCAACT TCGAAAAATT CGCTGGTGCT AACGACCGTC TGACCACTCA GATGAAATCG
GTTGGCGAAG TGATGGCGAT TGGTCGCACG CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC
GGCCTGGAAG TCGGTGCGAC TGGATTCGAC CCGAAAGTGA GCCTCGATGA CCCGGAAGCG
TTAACCAAAA TCCGTCGCGA ACTGAAAGAT GCTGGCGCAG AGCGTATCTG GTACATCGCC
GATGCCTTCC GTGCGGGCCT GTCTGTGGAC GGCGTGTTCA ACCTGACCAA TATTGACCGC
TGGTTCCTGG TACAGATTGA AGAACTGGTG CGTCTGGAAG AGAAAGTAGC GGAAGTGGGC
ATCACTGGCC TGAACGCTGA ATTCCTGCGC CAGCTGAAAC GCAAAGGCTT TGCCGATGCG
CGTCTGGCAA AACTGGCGGG CGTGCGCGAA GCGGAAATCC GCAAGCTGCG CGACCAGTAT
GACCTGCACC CGGTTTATAA GCGCGTGGAT ACCTGTGCGG CAGAGTTCGC CACCGACACC
GCTTACATGT ACTCCACTTA TGAAGAAGAG TGCGAAGCGA ATCCGTCTAC CGACCGTGAA
AAAATCATGG TTCTCGGCGG CGGCCCGAAC CGTATCGGTC AGGGTATCGA ATTCGACTAC
TGCTGTGTAC ACGCCTCGCT GGCGCTGCGC GAAGACGGTT ACGAAACCAT TATGGTTAAC
TGTAACCCGG AAACCGTCTC CACCGACTAC GACACTTCCG ATCGCCTCTA CTTCGAGCCG
GTAACTCTGG AAGATGTGCT AGAAATCGTG CGTATTGAGA AGCCGAAAGG CGTTATCGTT
CAGTACGGCG GTCAGACCCC GCTGAAACTG GCGCGCGCGC TGGAAGCTGC TGGCGTACCG
GTTATCGGCA CCAGCCCGGA TGCTATCGAC CGTGCAGAAG ACCGTGAACG CTTCCAGCAT
GCGGTTGAGC GTCTGAAACT GAAACAACCG GCGAACGCCA CCGTTACCGC TATTGAAATG
GCGGTAGAGA AGGCGAAAGA GATTGGCTAC CCGCTGGTGG TACGTCCGTC TTACGTTCTC
GGCGGTCGGG CGATGGAAAT CGTCTATGAC GAAGCTGACC TGCGTCGCTA CTTCCAGACG
GCGGTCAGCG TGTCTAACGA TGCGCCAGTG TTGCTGGATC ACTTCCTTGA TGACGCGGTA
GAAGTTGACG TGGATGCCAT CTGCGACGGC GAAATGGTGC TGATTGGCGG CATCATGGAG
CACATCGAGC AGGCGGGCGT GCACTCCGGT GACTCCGCAT GTTCTCTGCC AGCCTACACC
TTAAGTCAGG AAATTCAGGA TGTGATGCGC CAGCAGGTGC AGAAACTGGC CTTCGAATTG
CAGGTGCGCG GTCTGATGAA CGTGCAGTTT GCGGTGAAAA ACAACGAAGT CTACCTGATT
GAAGTTAACC CGCGTGCGGC GCGTACCGTT CCGTTCGTCT CCAAAGCCAC CGGCGTACCG
CTGGCAAAAG TGGCGGCGCG TGTGATGGCT GGCAAATCGC TGGCTGAGCA GGGCGTAACC
AAAGAAGTTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGCTGCCGTT CAATAAATTC
CCGGGTGTTG ACCCGCTGTT AGGGCCAGAA ATGCGCTCCA CCGGGGAAGT GATGGGCGTG
GGCCGCACCT TCGCTGAAGC GTTTGCCAAA GCGCAGCTGG GCAGCAACTC CACCATGAAG
AAACACGGTC GTGCGCTGCT TTCCGTGCGC GAAGGCGATA AAGAACGCGT GGTGGACCTG
GCGGCAAAAC TGCTGAAACA GGGCTTCGAG CTGGATGCGA CCCACGGCAC GGCGATTGTG
CTGGGCGAAG CGGGCATTAA TCCGCGTCTG GTGAACAAGG TGCATGAAGG CCGTCCGCAC
ATTCAGGACC GCATCAAGAA TGGCGAATAT ACCTACATCA TCAACACCAC TTCAGGCCGT
CGCGCGATTG AAGACTCCCG CGTGATCCGT CGCAGTGCGC TGCAATATAA AGTGCATTAT
GACACCACCC TGAACGGTGG TTTCGCTACC GCGATGGCGC TGAATGCCGA CGCGACCGAA
AAAGTAATTT CGGTGCAGGA AATGCACGCG CAGATCAAAT AA
 
Protein sequence
MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM 
ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA
DAIDKAEDRR RFDVAMKKIG LETARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGSGGG
IAYNREEFEE ICARGLDLSP TKELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM
GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP
RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA
LTKIRRELKD AGAERIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG
ITGLNAEFLR QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT
AYMYSTYEEE CEANPSTDRE KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN
CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP
VIGTSPDAID RAEDRERFQH AVERLKLKQP ANATVTAIEM AVEKAKEIGY PLVVRPSYVL
GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDHFLDDAV EVDVDAICDG EMVLIGGIME
HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKNNEVYLI
EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLAEQGVT KEVIPPYYSV KEVVLPFNKF
PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KHGRALLSVR EGDKERVVDL
AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTSGR
RAIEDSRVIR RSALQYKVHY DTTLNGGFAT AMALNADATE KVISVQEMHA QIK