Gene EcDH1_3566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3566 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3837339 
End bp3840560 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content56% 
IMG OID 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionACX41180 
Protein GI260450758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAC GTACAGATAT AAAAAGTATC CTGATTCTGG GTGCGGGCCC GATTGTTATC 
GGTCAGGCGT GTGAGTTTGA CTACTCTGGC GCGCAAGCGT GTAAAGCCCT GCGTGAAGAG
GGTTACCGCG TCATTCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAAATG
GCTGATGCAA CCTACATCGA GCCGATTCAC TGGGAAGTTG TACGCAAGAT TATTGAAAAA
GAGCGCCCGG ACGCGGTGCT GCCAACGATG GGCGGTCAGA CGGCGCTGAA CTGCGCGCTG
GAGCTGGAAC GTCAGGGCGT GTTGGAAGAG TTCGGTGTCA CCATGATTGG TGCCACTGCC
GATGCGATTG ATAAAGCAGA AGACCGCCGT CGTTTCGACG TAGCGATGAA GAAAATTGGT
CTGGAAACCG CGCGTTCCGG TATCGCACAC ACGATGGAAG AAGCGCTGGC GGTTGCCGCT
GACGTGGGCT TCCCGTGCAT TATTCGCCCA TCCTTTACCA TGGGCGGTAG CGGCGGCGGT
ATCGCTTATA ACCGTGAAGA GTTTGAAGAA ATTTGCGCCC GCGGTCTGGA TCTCTCTCCG
ACCAAAGAGT TGCTGATTGA TGAGTCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG
GTGCGTGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT CGATGCGATG
GGCATCCACA CCGGTGACTC CATCACTGTC GCGCCAGCCC AAACGCTGAC CGACAAAGAA
TATCAAATCA TGCGTAACGC CTCGATGGCG GTGCTGCGTG AAATCGGCGT TGAAACCGGT
GGTTCCAACG TTCAGTTTGC GGTGAATCCG AAAAACGGTC GTCTGATTGT TATCGAAATG
AACCCACGCG TGTCCCGTTC TTCGGCGCTG GCGTCGAAAG CGACCGGTTT CCCGATTGCT
AAAGTGGCGG CGAAACTGGC GGTGGGTTAC ACCCTCGACG AACTGATGAA CGACATCACT
GGCGGACGTA CTCCGGCCTC CTTCGAGCCG TCCATCGACT ATGTGGTTAC TAAAATTCCT
CGCTTCAACT TCGAAAAATT CGCCGGTGCT AACGACCGTC TGACCACTCA GATGAAATCG
GTTGGCGAAG TGATGGCGAT TGGTCGCACG CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC
GGCCTGGAAG TCGGTGCGAC TGGATTCGAC CCGAAAGTGA GCCTGGATGA CCCGGAAGCG
TTAACCAAAA TCCGTCGCGA ACTGAAAGAC GCAGGCGCAG ATCGTATCTG GTACATCGCC
GATGCGTTCC GTGCGGGCCT GTCTGTGGAC GGCGTCTTCA ACCTGACCAA CATTGACCGC
TGGTTCCTGG TACAGATTGA AGAGCTGGTG CGTCTGGAAG AGAAAGTGGC GGAAGTGGGC
ATCACTGGCC TGAACGCTGA CTTCCTGCGC CAGCTGAAAC GCAAAGGCTT TGCCGATGCG
CGCTTGGCAA AACTGGCGGG CGTACGCGAA GCGGAAATCC GTAAGCTGCG TGACCAGTAT
GACCTGCACC CGGTTTATAA GCGCGTGGAT ACCTGTGCGG CAGAGTTCGC CACCGACACC
GCTTACATGT ACTCCACTTA TGAAGAAGAG TGCGAAGCGA ATCCGTCTAC CGACCGTGAA
AAAATCATGG TGCTTGGCGG CGGCCCGAAC CGTATCGGTC AGGGTATCGA ATTCGACTAC
TGTTGCGTAC ACGCCTCGCT GGCGCTGCGC GAAGACGGTT ACGAAACCAT TATGGTTAAC
TGTAACCCGG AAACCGTCTC CACCGACTAC GACACTTCCG ACCGCCTCTA CTTCGAGCCG
GTAACTCTGG AAGATGTGCT GGAAATCGTG CGTATCGAGA AGCCGAAAGG CGTTATCGTC
CAGTACGGCG GTCAGACCCC GCTGAAACTG GCGCGCGCGC TGGAAGCTGC TGGCGTACCG
GTTATCGGCA CCAGCCCGGA TGCTATCGAC CGTGCAGAAG ACCGTGAACG CTTCCAGCAT
GCGGTTGAGC GTCTGAAACT GAAACAACCG GCGAACGCCA CCGTTACCGC TATTGAAATG
GCGGTAGAGA AGGCGAAAGA GATTGGCTAC CCGCTGGTGG TACGTCCGTC TTACGTTCTC
GGCGGTCGGG CGATGGAAAT CGTCTATGAC GAAGCTGACC TGCGTCGCTA CTTCCAGACG
GCGGTCAGCG TGTCTAACGA TGCGCCAGTG TTGCTGGACC ACTTCCTCGA TGACGCGGTA
GAAGTTGACG TGGATGCCAT CTGCGACGGC GAAATGGTGC TGATTGGCGG CATCATGGAG
CATATTGAGC AGGCGGGCGT GCACTCCGGT GACTCCGCAT GTTCTCTGCC AGCCTACACC
TTAAGTCAGG AAATTCAGGA TGTGATGCGC CAGCAGGTGC AGAAACTGGC CTTCGAATTG
CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTGAAAA ACAACGAAGT CTACCTGATT
GAAGTTAACC CGCGTGCGGC GCGTACCGTT CCGTTCGTCT CCAAAGCCAC CGGCGTACCG
CTGGCAAAAG TGGCGGCGCG CGTGATGGCT GGCAAATCGC TGGCTGAGCA GGGCGTAACC
AAAGAAGTTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGCTGCCGTT CAATAAATTC
CCGGGCGTTG ACCCGCTGTT AGGGCCAGAA ATGCGCTCTA CCGGGGAAGT CATGGGCGTG
GGCCGCACCT TCGCTGAAGC GTTTGCCAAA GCGCAGCTGG GCAGCAACTC CACCATGAAG
AAACACGGTC GTGCGCTGCT TTCCGTGCGC GAAGGCGATA AAGAACGCGT GGTGGACCTG
GCGGCAAAAC TGCTGAAACA GGGCTTCGAG CTGGATGCGA CCCACGGCAC GGCGATTGTG
CTGGGCGAAG CAGGTATCAA CCCGCGTCTG GTAAACAAGG TGCATGAAGG CCGTCCGCAC
ATTCAGGACC GTATCAAGAA TGGCGAATAT ACCTACATCA TCAACACCAC CTCAGGCCGT
CGTGCGATTG AAGACTCCCG CGTGATTCGT CGCAGTGCGC TGCAATATAA AGTGCATTAC
GACACCACCC TGAACGGCGG CTTTGCCACC GCGATGGCGC TGAATGCCGA TGCGACTGAA
AAAGTAATTT CGGTGCAGGA AATGCACGCA CAGATCAAAT AA
 
Protein sequence
MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM 
ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA
DAIDKAEDRR RFDVAMKKIG LETARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGSGGG
IAYNREEFEE ICARGLDLSP TKELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM
GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP
RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA
LTKIRRELKD AGADRIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG
ITGLNADFLR QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT
AYMYSTYEEE CEANPSTDRE KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN
CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP
VIGTSPDAID RAEDRERFQH AVERLKLKQP ANATVTAIEM AVEKAKEIGY PLVVRPSYVL
GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDHFLDDAV EVDVDAICDG EMVLIGGIME
HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKNNEVYLI
EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLAEQGVT KEVIPPYYSV KEVVLPFNKF
PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KHGRALLSVR EGDKERVVDL
AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTSGR
RAIEDSRVIR RSALQYKVHY DTTLNGGFAT AMALNADATE KVISVQEMHA QIK