Gene SeD_A0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0073 
SymbolcarB 
ID6871253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp76863 
End bp80090 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content59% 
IMG OID642783328 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_002214022 
Protein GI198241932 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.841404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC GTACAGATAT AAAAAGCATC CTGATTCTGG GCGCGGGCCC GATTGTTATC 
GGTCAGGCGT GTGAGTTTGA CTACTCCGGC GCTCAGGCGT GTAAAGCCCT GCGCGAAGAG
GGCTATCGCG TCATTCTGGT GAACTCCAAC CCGGCCACCA TCATGACCGA CCCGGAAATG
GCCGATGCCA CCTACATCGA GCCGATTCAC TGGGAAGTGG TGCGCAAAAT CATTGAAAAA
GAGCGTCCGG ATGCGGTGCT GCCGACCATG GGCGGCCAGA CCGCGCTGAA CTGCGCGCTG
GAGCTGGAGC GGCAGGGCGT GCTGGAAGAG TTCGGCGTCA CCATGATTGG CGCCACCGCC
GACGCCATTG ATAAAGCCGA AGACCGTCGT CGCTTCGATA TCGCGATGAA GAAAATTGGT
CTCGACACCG CGCGTTCCGG TATCGCACAC ACTATGGAAG AAGCGCTGGC GGTTGCCGCT
GACGTGGGCT TCCCGTGCAT CATCCGTCCG TCCTTTACCA TGGGCGGCAC CGGCGGCGGT
ATCGCTTACA ACCGTGAAGA GTTCGAAGAA ATCTGCGAAC GCGGTCTGGA TCTCTCGCCA
ACCAACGAAC TGCTGATTGA TGAATCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG
GTGCGTGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT CGATGCGATG
GGTATCCACA CCGGTGACTC CATCACCGTG GCCCCGGCGC AGACTCTGAC CGACAAAGAA
TACCAAATCA TGCGTAACGC CTCGATGGCG GTTCTGCGTG AAATCGGCGT CGAAACCGGC
GGTTCTAACG TCCAGTTCGC AGTGAACCCG AAAAACGGCC GTCTGATCGT TATCGAAATG
AACCCGCGCG TATCCCGCTC CTCGGCGCTG GCGTCGAAAG CCACCGGTTT CCCGATTGCC
AAAGTGGCCG CCAAACTGGC GGTAGGTTAC ACCCTTGACG AACTGATGAA CGACATCACC
GGCGGGCGCA CTCCGGCCTC CTTCGAGCCG TCCATCGACT ATGTGGTTAC TAAAATTCCG
CGCTTTAACT TTGAGAAATT CGCCGGTGCC AACGACCGTC TGACCACCCA GATGAAATCG
GTCGGAGAAG TGATGGCGAT TGGTCGCACC CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC
GGCCTGGAAG TCGGCGCCAC CGGTTTTGAC CCGAAAGTGA GCCTCGACGA CCCGGAAGCG
CTGACCAAAA TCCGCCGCGA GCTGAAAGAC GCGGGCGCAG ATCGTATCTG GTACATCGCC
GATGCTTTCC GTGCGGGCTT GTCTGTGGAT GGCGTCTTTA ACCTGACCAA CATCGACCGC
TGGTTCCTGG TGCAAATTGA AGAGCTGGTG CGTCTGGAAG AGAAAGTGGC TGAAGTCGGG
ATTACTGGCC TCAACGCCGA CTTCCTGCTT CAGCTCAAGC GTAAAGGGTT TGCCGATGCG
CGTCTGGCAA AACTGGCTGG CGTACGCGAG GCGGAAATCC GTAAGCTGCG TGATCAGTAT
GACCTGCACC CGGTTTACAA ACGCGTGGAT ACCTGCGCGG CGGAATTCGC CACCGATACC
GCCTACATGT ACTCCACTTA TGAAGATGAG TGCGAAGCGA ACCCGTCCGT TGACCGCGAT
AAAATCATGG TCCTCGGCGG CGGCCCGAAC CGTATCGGCC AGGGTATCGA ATTTGACTAC
TGCTGCGTAC ACGCCTCGCT GGCGCTGCGC GAAGACGGTT ACGAGACCAT CATGGTCAAC
TGTAACCCGG AAACCGTCTC CACCGACTAC GACACTTCCG ACCGTCTGTA CTTCGAGCCG
GTGACGCTGG AAGACGTGCT GGAAATCGTG CGCATCGAGA AGCCGAAAGG CGTTATCGTG
CAGTACGGCG GCCAGACCCC GCTGAAGCTG GCGCGCGCGC TGGAAGCGGC AGGCGTGCCG
GTTATCGGCA CCAGCCCGGA CGCCATCGAC CGCGCGGAAG ACCGCGAGCG CTTCCAGCAC
GCGGTTGACC GTCTGAAGCT GAAGCAACCG GCCAACGCCA CCGTCACCGC CATTGAACAG
GCTGTCGAAA AAGCGAAAGA GATCGGCTAC CCGCTGGTGG TGCGTCCTTC TTACGTGCTG
GGCGGCCGGG CGATGGAAAT TGTCTATGAC GAAGCCGATC TGCGTCGCTA CTTCCAGACA
GCGGTCAGCG TCTCTAACGA TGCGCCGGTG CTGCTGGACC GCTTCCTTGA TGACGCGGTT
GAAGTGGACG TGGACGCTAT CTGCGACGGC GAAATGGTGC TGATTGGCGG TATCATGGAG
CACATCGAGC AGGCGGGCGT ACACTCCGGC GACTCCGCCT GTTCCCTGCC GGCCTACACG
CTGAGCCAGG AAATTCAGGA CGTGATGCGC CAGCAGGTGC AGAAGCTGGC CTTCGAGTTG
CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTGAAAG ACAACGAAGT CTATCTGATT
GAAGTCAACC CGCGTGCGGC GCGTACCGTA CCGTTCGTCT CCAAAGCCAC CGGCGTTCCG
CTGGCGAAAG TGGCGGCGCG CGTGATGGCC GGCAAATCGC TGACCGAGCA GGGCGTGACC
AAAGAAATTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGTTGCCGTT TAACAAATTC
CCGGGCGTTG ACCCGCTGTT AGGGCCAGAA ATGCGCTCCA CCGGGGAAGT GATGGGCGTG
GGCCGTACCT TCGCGGAAGC GTTCGCTAAG GCGCAGCTGG GCAGTAACTC CACCATGAAG
AAACAGGGCC GTGCGCTGCT CTCGGTTCGC GAAGGCGACA AAGAGCGCGT GGTGGATCTC
GCCGCTAAGC TGCTGAAACA GGGCTTTGAG CTGGATGCCA CCCACGGTAC GGCGATTGTG
CTGGGCGAAG CTGGTATCAA CCCGCGTCTG GTGAACAAGG TGCACGAAGG TCGTCCGCAC
ATTCAGGACC GTATCAAGAA TGGCGAATAT ACTTATATCA TCAACACCAC CGCAGGCCGC
CGCGCGATTG AAGACTCCAG GGTGATTCGC CGCAGCGCGC TGCAGTATAA AGTGCATTAT
GACACCACGC TGAACGGCGG TTTTGCAACG ACGATGGCGC TCAATGCCGA TGCCACCGAG
AAGGTAACCT CGGTGCAGGA AATGCACGCG CAGATCAAAA AGTCGTAA
 
Protein sequence
MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM 
ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA
DAIDKAEDRR RFDIAMKKIG LDTARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGTGGG
IAYNREEFEE ICERGLDLSP TNELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM
GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP
RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA
LTKIRRELKD AGADRIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVAEVG
ITGLNADFLL QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT
AYMYSTYEDE CEANPSVDRD KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN
CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP
VIGTSPDAID RAEDRERFQH AVDRLKLKQP ANATVTAIEQ AVEKAKEIGY PLVVRPSYVL
GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDRFLDDAV EVDVDAICDG EMVLIGGIME
HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKDNEVYLI
EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLTEQGVT KEIIPPYYSV KEVVLPFNKF
PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KQGRALLSVR EGDKERVVDL
AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTAGR
RAIEDSRVIR RSALQYKVHY DTTLNGGFAT TMALNADATE KVTSVQEMHA QIKKS