Gene EcolC_3622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3622 
SymbolcarB 
ID6065897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3964185 
End bp3967406 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content56% 
IMG OID641603040 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001726563 
Protein GI170021609 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase
[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.443028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC GTACAGATAT AAAAAGTATC CTGATTCTGG GTGCGGGCCC GATTGTTATC 
GGTCAGGCGT GTGAGTTTGA CTACTCTGGC GCGCAAGCGT GTAAAGCCCT GCGTGAAGAG
GGTTACCGCG TCATTCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAAATG
GCTGATGCAA CCTACATCGA GCCGATTCAC TGGGAAGTTG TACGCAAGAT TATTGAAAAA
GAGCGCCCGG ACGCGGTGCT GCCAACGATG GGCGGTCAGA CGGCGCTGAA CTGCGCGCTG
GAGCTGGAAC GTCAGGGCGT GTTGGAAGAG TTCGGTGTCA CCATGATTGG TGCCACTGCC
GATGCGATTG ATAAAGCAGA AGACCGCCGT CGTTTCGACG TAGCGATGAA GAAAATTGGT
CTGGAAACCG CGCGTTCCGG TATCGCACAC ACGATGGAAG AAGCGCTGGC GGTTGCCGCT
GACGTGGGCT TCCCGTGCAT TATTCGCCCA TCCTTTACCA TGGGCGGTAG CGGCGGCGGT
ATCGCTTATA ACCGTGAAGA GTTTGAAGAA ATTTGCGCCC GCGGTCTGGA TCTCTCTCCG
ACCAAAGAGT TGCTGATTGA TGAGTCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG
GTGCGTGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT CGATGCGATG
GGCATCCACA CCGGTGACTC CATCACTGTC GCGCCAGCCC AAACGCTGAC CGACAAAGAA
TATCAAATCA TGCGTAACGC CTCGATGGCG GTGCTGCGTG AAATCGGCGT TGAAACCGGT
GGTTCCAACG TTCAGTTTGC GGTGAACCCG AAAAACGGTC GTCTGATTGT TATCGAAATG
AACCCACGCG TGTCCCGTTC TTCGGCGCTG GCGTCGAAAG CGACCGGTTT CCCGATTGCT
AAAGTGGCGG CGAAACTGGC GGTGGGTTAC ACCCTCGACG AACTGATGAA CGACATCACT
GGCGGACGTA CTCCGGCCTC CTTCGAGCCG TCCATCGACT ATGTGGTTAC TAAAATTCCT
CGCTTCAACT TCGAAAAATT CGCCGGTGCT AACGACCGTC TGACCACTCA GATGAAATCG
GTTGGCGAAG TGATGGCGAT TGGTCGCACG CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC
GGCCTGGAAG TCGGTGCGAC TGGATTCGAC CCGAAAGTGA GCCTGGATGA CCCGGAAGCG
TTAACCAAAA TCCGTCGCGA ACTGAAAGAC GCAGGCGCAG ATCGTATCTG GTACATCGCC
GATGCGTTCC GTGCGGGCCT GTCTGTGGAC GGCGTCTTCA ACCTGACCAA TATTGACCGC
TGGTTCCTGG TACAGATTGA AGAACTGGTG CGTCTGGAAG AGAAAGTAGC GGAAGTGGGC
ATCACTGGCC TGAACGCTGA ATTCCTGCGC CAGCTGAAAC GCAAAGGCTT TGCCGATGCG
CGTCTGGCAA AACTGGCGGG CGTGCGCGAA GCGGAAATCC GCAAGCTGCG CGACCAGTAT
GACCTGCACC CGGTTTATAA GCGCGTGGAT ACCTGTGCGG CAGAGTTCGC CACCGACACC
GCTTACATGT ACTCCACTTA TGAAGAAGAG TGCGAAGCGA ATCCGTCTAC CGACCGTGAA
AAAATCATGG TTCTCGGCGG CGGCCCGAAC CGTATCGGTC AGGGTATCGA ATTCGACTAC
TGCTGTGTAC ACGCCTCGCT GGCGCTGCGC GAAGACGGTT ACGAAACCAT TATGGTTAAC
TGTAACCCGG AAACCGTCTC CACCGACTAC GACACTTCCG ACCGCCTCTA CTTCGAGCCG
GTAACCCTGG AAGATGTGCT GGAAATCGTG CGTATTGAGA AGCCGAAAGG CGTTATCGTT
CAGTACGGCG GTCAGACCCC GCTGAAACTG GCGCGTGCAC TGGAAGCCGC TGGCGTACCG
GTTATCGGCA CCAGCCCGGA TGCTATCGAC CGTGCGGAAG ACCGTGAACG CTTCCAGCAC
GCGGTTGACC GCCTGAAACT GAAGCAACCG GCGAACGCCA CCGTTACCGC TATCGAAATG
GCGGTAGAGA AGGCGAAAGA GATTGGCTAC CCGCTGGTGG TACGTCCGTC TTACGTTCTC
GGCGGTCGGG CGATGGAAAT CGTCTATGAC GAAGCAGACC TGCGTCGCTA CTTCCAGACG
GCGGTCAGCG TGTCTAACGA TGCGCCAGTG TTGCTGGACC ACTTCCTCGA TGACGCGGTA
GAAGTGGATG TGGACGCCAT CTGCGACGGC GAAATGGTGC TGATTGGCGG CATCATGGAG
CATATTGAGC AGGCGGGCGT GCACTCCGGT GACTCCGCAT GTTCGCTGCC AGCCTACACT
TTAAGTCAGG AAATTCAGGA TGTGATGCGC CAGCAGGTGC AGAAACTGGC CTTCGAATTG
CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTGAAAA ACAACGAAGT CTACCTGATT
GAAGTTAACC CGCGTGCGGC GCGTACCGTT CCGTTCGTCT CCAAAGCCAC CGGCGTACCG
CTGGCAAAAG TGGCGGCGCG CGTGATGGCT GGCAAATCGC TGGCTGAGCA GGGCGTAACC
AAAGAAGTTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGCTGCCGTT CAATAAATTC
CCGGGCGTTG ACCCGCTGTT AGGGCCAGAA ATGCGCTCTA CCGGGGAAGT CATGGGCGTG
GGCCGCACCT TCGCTGAAGC GTTTGCCAAA GCGCAGCTGG GCAGCAACTC CACCATGAAG
AAACACGGTC GTGCGCTGCT TTCCGTGCGC GAAGGCGATA AAGAACGCGT GGTGGACCTG
GCGGCAAAAC TGCTGAAACA GGGCTTCGAG CTGGATGCGA CCCACGGCAC GGCGATTGTG
CTGGGCGAAG CAGGTATCAA CCCGCGTCTG GTAAACAAGG TGCATGAAGG CCGTCCGCAC
ATTCAGGACC GTATCAAGAA TGGCGAATAT ACCTACATCA TCAACACCAC CTCAGGCCGT
CGTGCGATTG AAGACTCCCG CGTGATTCGT CGCAGTGCGC TGCAATATAA AGTGCATTAC
GACACCACCC TGAACGGCGG CTTTGCCACC GCGATGGCGC TGAATGCCGA TGCGACTGAA
AAAGTAATTT CGGTGCAGGA AATGCACGCA CAGATCAAAT AA
 
Protein sequence
MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM 
ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA
DAIDKAEDRR RFDVAMKKIG LETARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGSGGG
IAYNREEFEE ICARGLDLSP TKELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM
GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP
RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA
LTKIRRELKD AGADRIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG
ITGLNAEFLR QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT
AYMYSTYEEE CEANPSTDRE KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN
CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP
VIGTSPDAID RAEDRERFQH AVDRLKLKQP ANATVTAIEM AVEKAKEIGY PLVVRPSYVL
GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDHFLDDAV EVDVDAICDG EMVLIGGIME
HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKNNEVYLI
EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLAEQGVT KEVIPPYYSV KEVVLPFNKF
PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KHGRALLSVR EGDKERVVDL
AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTSGR
RAIEDSRVIR RSALQYKVHY DTTLNGGFAT AMALNADATE KVISVQEMHA QIK