Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0073 |
Symbol | carB |
ID | 6871253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 76863 |
End bp | 80090 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642783328 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_002214022 |
Protein GI | 198241932 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.841404 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAC GTACAGATAT AAAAAGCATC CTGATTCTGG GCGCGGGCCC GATTGTTATC GGTCAGGCGT GTGAGTTTGA CTACTCCGGC GCTCAGGCGT GTAAAGCCCT GCGCGAAGAG GGCTATCGCG TCATTCTGGT GAACTCCAAC CCGGCCACCA TCATGACCGA CCCGGAAATG GCCGATGCCA CCTACATCGA GCCGATTCAC TGGGAAGTGG TGCGCAAAAT CATTGAAAAA GAGCGTCCGG ATGCGGTGCT GCCGACCATG GGCGGCCAGA CCGCGCTGAA CTGCGCGCTG GAGCTGGAGC GGCAGGGCGT GCTGGAAGAG TTCGGCGTCA CCATGATTGG CGCCACCGCC GACGCCATTG ATAAAGCCGA AGACCGTCGT CGCTTCGATA TCGCGATGAA GAAAATTGGT CTCGACACCG CGCGTTCCGG TATCGCACAC ACTATGGAAG AAGCGCTGGC GGTTGCCGCT GACGTGGGCT TCCCGTGCAT CATCCGTCCG TCCTTTACCA TGGGCGGCAC CGGCGGCGGT ATCGCTTACA ACCGTGAAGA GTTCGAAGAA ATCTGCGAAC GCGGTCTGGA TCTCTCGCCA ACCAACGAAC TGCTGATTGA TGAATCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG GTGCGTGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT CGATGCGATG GGTATCCACA CCGGTGACTC CATCACCGTG GCCCCGGCGC AGACTCTGAC CGACAAAGAA TACCAAATCA TGCGTAACGC CTCGATGGCG GTTCTGCGTG AAATCGGCGT CGAAACCGGC GGTTCTAACG TCCAGTTCGC AGTGAACCCG AAAAACGGCC GTCTGATCGT TATCGAAATG AACCCGCGCG TATCCCGCTC CTCGGCGCTG GCGTCGAAAG CCACCGGTTT CCCGATTGCC AAAGTGGCCG CCAAACTGGC GGTAGGTTAC ACCCTTGACG AACTGATGAA CGACATCACC GGCGGGCGCA CTCCGGCCTC CTTCGAGCCG TCCATCGACT ATGTGGTTAC TAAAATTCCG CGCTTTAACT TTGAGAAATT CGCCGGTGCC AACGACCGTC TGACCACCCA GATGAAATCG GTCGGAGAAG TGATGGCGAT TGGTCGCACC CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC GGCCTGGAAG TCGGCGCCAC CGGTTTTGAC CCGAAAGTGA GCCTCGACGA CCCGGAAGCG CTGACCAAAA TCCGCCGCGA GCTGAAAGAC GCGGGCGCAG ATCGTATCTG GTACATCGCC GATGCTTTCC GTGCGGGCTT GTCTGTGGAT GGCGTCTTTA ACCTGACCAA CATCGACCGC TGGTTCCTGG TGCAAATTGA AGAGCTGGTG CGTCTGGAAG AGAAAGTGGC TGAAGTCGGG ATTACTGGCC TCAACGCCGA CTTCCTGCTT CAGCTCAAGC GTAAAGGGTT TGCCGATGCG CGTCTGGCAA AACTGGCTGG CGTACGCGAG GCGGAAATCC GTAAGCTGCG TGATCAGTAT GACCTGCACC CGGTTTACAA ACGCGTGGAT ACCTGCGCGG CGGAATTCGC CACCGATACC GCCTACATGT ACTCCACTTA TGAAGATGAG TGCGAAGCGA ACCCGTCCGT TGACCGCGAT AAAATCATGG TCCTCGGCGG CGGCCCGAAC CGTATCGGCC AGGGTATCGA ATTTGACTAC TGCTGCGTAC ACGCCTCGCT GGCGCTGCGC GAAGACGGTT ACGAGACCAT CATGGTCAAC TGTAACCCGG AAACCGTCTC CACCGACTAC GACACTTCCG ACCGTCTGTA CTTCGAGCCG GTGACGCTGG AAGACGTGCT GGAAATCGTG CGCATCGAGA AGCCGAAAGG CGTTATCGTG CAGTACGGCG GCCAGACCCC GCTGAAGCTG GCGCGCGCGC TGGAAGCGGC AGGCGTGCCG GTTATCGGCA CCAGCCCGGA CGCCATCGAC CGCGCGGAAG ACCGCGAGCG CTTCCAGCAC GCGGTTGACC GTCTGAAGCT GAAGCAACCG GCCAACGCCA CCGTCACCGC CATTGAACAG GCTGTCGAAA AAGCGAAAGA GATCGGCTAC CCGCTGGTGG TGCGTCCTTC TTACGTGCTG GGCGGCCGGG CGATGGAAAT TGTCTATGAC GAAGCCGATC TGCGTCGCTA CTTCCAGACA GCGGTCAGCG TCTCTAACGA TGCGCCGGTG CTGCTGGACC GCTTCCTTGA TGACGCGGTT GAAGTGGACG TGGACGCTAT CTGCGACGGC GAAATGGTGC TGATTGGCGG TATCATGGAG CACATCGAGC AGGCGGGCGT ACACTCCGGC GACTCCGCCT GTTCCCTGCC GGCCTACACG CTGAGCCAGG AAATTCAGGA CGTGATGCGC CAGCAGGTGC AGAAGCTGGC CTTCGAGTTG CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTGAAAG ACAACGAAGT CTATCTGATT GAAGTCAACC CGCGTGCGGC GCGTACCGTA CCGTTCGTCT CCAAAGCCAC CGGCGTTCCG CTGGCGAAAG TGGCGGCGCG CGTGATGGCC GGCAAATCGC TGACCGAGCA GGGCGTGACC AAAGAAATTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGTTGCCGTT TAACAAATTC CCGGGCGTTG ACCCGCTGTT AGGGCCAGAA ATGCGCTCCA CCGGGGAAGT GATGGGCGTG GGCCGTACCT TCGCGGAAGC GTTCGCTAAG GCGCAGCTGG GCAGTAACTC CACCATGAAG AAACAGGGCC GTGCGCTGCT CTCGGTTCGC GAAGGCGACA AAGAGCGCGT GGTGGATCTC GCCGCTAAGC TGCTGAAACA GGGCTTTGAG CTGGATGCCA CCCACGGTAC GGCGATTGTG CTGGGCGAAG CTGGTATCAA CCCGCGTCTG GTGAACAAGG TGCACGAAGG TCGTCCGCAC ATTCAGGACC GTATCAAGAA TGGCGAATAT ACTTATATCA TCAACACCAC CGCAGGCCGC CGCGCGATTG AAGACTCCAG GGTGATTCGC CGCAGCGCGC TGCAGTATAA AGTGCATTAT GACACCACGC TGAACGGCGG TTTTGCAACG ACGATGGCGC TCAATGCCGA TGCCACCGAG AAGGTAACCT CGGTGCAGGA AATGCACGCG CAGATCAAAA AGTCGTAA
|
Protein sequence | MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA DAIDKAEDRR RFDIAMKKIG LDTARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGTGGG IAYNREEFEE ICERGLDLSP TNELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA LTKIRRELKD AGADRIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG ITGLNADFLL QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT AYMYSTYEDE CEANPSVDRD KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP VIGTSPDAID RAEDRERFQH AVDRLKLKQP ANATVTAIEQ AVEKAKEIGY PLVVRPSYVL GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDRFLDDAV EVDVDAICDG EMVLIGGIME HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKDNEVYLI EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLTEQGVT KEIIPPYYSV KEVVLPFNKF PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KQGRALLSVR EGDKERVVDL AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTAGR RAIEDSRVIR RSALQYKVHY DTTLNGGFAT TMALNADATE KVTSVQEMHA QIKKS
|
| |