Gene SeHA_C0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0071 
SymbolcarB 
ID6491670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp72325 
End bp75552 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content59% 
IMG OID642740360 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_002044034 
Protein GI194447884 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC GTACAGATAT AAAAAGCATC CTGATTCTGG GCGCGGGCCC GATTGTTATC 
GGTCAGGCGT GTGAGTTTGA CTACTCCGGC GCTCAGGCAT GTAAAGCGCT GCGCGAAGAG
GGTTACCGCG TTATTCTGGT GAACTCCAAC CCGGCCACCA TCATGACCGA CCCGGAAATG
GCCGATGCCA CCTACATCGA GCCGATTCAC TGGGAAGTGG TGCGCAAAAT CATCGAAAAA
GAGCGTCCGG ATGCGGTGCT GCCGACCATG GGCGGCCAGA CCGCGCTGAA CTGTGCGCTG
GAGCTGGAGC GTCAGGGCGT GCTCGAAGAG TTCGGTGTGA CCATGATTGG CGCCACCGCC
GACGCCATTG ATAAAGCCGA AGACCGTCGT CGCTTCGATA TCGCGATGAA GAAAATTGGT
CTCGACACCG CGCGTTCCGG TATCGCGCAC ACTATGGAAG AAGCGCTGGC GGTTGCCGCT
GACGTGGGCT TCCCGTGCAT CATCCGTCCG TCCTTTACCA TGGGCGGCAC CGGCGGCGGT
ATCGCTTACA ACCGTGAAGA GTTCGAAGAA ATCTGCGAAC GCGGTCTGGA CCTCTCGCCA
ACCAACGAGC TGCTGATTGA TGAATCGCTG ATCGGCTGGA AAGAGTACGA GATGGAAGTG
GTGCGTGATA AAAACGACAA CTGCATCATC GTCTGCTCTA TCGAAAACTT CGATGCGATG
GGTATCCACA CCGGTGACTC CATCACCGTG GCCCCGGCAC AGACGCTGAC CGACAAAGAA
TACCAAATCA TGCGTAACGC CTCGATGGCG GTACTGCGTG AAATCGGCGT CGAAACCGGC
GGTTCTAACG TCCAGTTCGC CGTGAACCCG AAAAACGGCC GTCTGATCGT TATCGAAATG
AACCCGCGCG TCTCCCGCTC CTCGGCGCTG GCGTCGAAAG CCACCGGTTT CCCGATTGCT
AAAGTGGCGG CCAAACTGGC GGTGGGTTAT ACCCTCGACG AGCTGATGAA CGACATCACC
GGTGGCCGTA CGCCGGCGTC GTTTGAGCCG TCTATTGACT ACGTTGTCAC CAAAATTCCG
CGCTTTAACT TTGAGAAATT CGCCGGTGCT AACGACCGTC TGACCACCCA GATGAAATCG
GTCGGGGAAG TGATGGCGAT TGGCCGCACC CAGCAGGAAT CCCTGCAAAA AGCGCTGCGC
GGCCTGGAAG TGGGCGCCAC CGGCTTCGAC CCGAAAGTCA GCCTCGACGA CCCGGAAGCG
CTGACCAAAA TCCGCCGCGA GCTGAAAGAC GCGGGCGCGG ATCGTATCTG GTATATCGCC
GATGCCTTCC GCGCAGGCCT CTCCGTCGAC GGCGTGTTCA ACCTGACCAA CATCGACCGC
TGGTTCCTGG TGCAAATTGA AGAGCTGGTG CGTCTGGAAG AGAAAGTAAC TGAAGTCGGG
ATTACTGGCC TCAACGCCGA CTTCCTGCGT CAGCTCAAGC GTAAAGGTTT TGCCGATGCG
CGTCTGGCAA AATTGGCGGG CGTGCGCGAG GCGGAAATCC GCAAACTGCG CGACCAGTAT
GACCTGCACC CGGTTTACAA ACGCGTGGAT ACCTGCGCGG CGGAATTCGC CACCGATACC
GCCTACATGT ACTCCACTTA TGAAGATGAG TGCGAAGCGA ACCCGTCCGT TGACCGCGAT
AAAATCATGG TCCTCGGCGG CGGCCCGAAC CGTATCGGCC AGGGTATCGA ATTTGACTAC
TGCTGCGTAC ACGCCTCGCT GGCGCTGCGC GAAGACGGTT ACGAGACCAT CATGGTCAAC
TGTAACCCGG AAACCGTCTC CACCGACTAC GACACTTCCG ACCGTCTGTA CTTCGAGCCG
GTGACGCTGG AAGACGTGCT GGAAATCGTG CGCATCGAGA AGCCGAAAGG CGTTATCGTG
CAGTACGGCG GCCAGACCCC GCTGAAGCTG GCGCGCGCGC TGGAAGCGGC AGGCGTGCCG
GTTATCGGCA CCAGCCCGGA CGCCATCGAC CGCGCGGAAG ACCGTGAACG CTTCCAGCAT
GCGGTTGACC GTCTGAAGCT GAAGCAACCG GCCAACGCCA CCGTCACCGC CATTGAACAG
GCTGTCGAAA AAGCGAAAGA GATCGGCTAC CCGCTGGTGG TGCGTCCTTC TTACGTGCTG
GGCGGCCGGG CGATGGAAAT TGTCTATGAC GAAGCCGATC TGCGTCGCTA CTTCCAGACA
GCGGTCAGCG TCTCTAACGA TGCGCCGGTG CTGCTGGACC GCTTCCTTGA TGACGCGGTT
GAAGTGGACG TGGACGCTAT CTGCGACGGC GAAATGGTGC TGATTGGCGG CATCATGGAA
CACATAGAGC AGGCGGGCGT ACACTCCGGC GACTCCGCCT GTTCCCTGCC GGCCTACACG
CTGAGCCAGG AGATTCAGGA TGTGATGCGC CAACAGGTGC AGAAGCTGGC CTTCGAGTTG
CAGGTGCGCG GCCTGATGAA CGTGCAGTTT GCGGTGAAAG ACAACGAAGT CTATCTGATT
GAAGTCAACC CGCGTGCGGC GCGTACCGTA CCGTTCGTCT CCAAAGCCAC CGGCGTTCCG
CTGGCGAAAG TGGCGGCGCG CGTGATGGCC GGCAAATCGC TGACCGAGCA GGGCGTGACC
CAAGAAATTA TCCCGCCGTA CTACTCGGTG AAAGAAGTGG TGCTGCCGTT CAACAAATTC
CCGGGCGTTG ACCCGCTGTT AGGGCCAGAA ATGCGCTCCA CCGGGGAAGT GATGGGCGTG
GGCCGTACCT TCGCGGAGGC GTTCGCTAAG GCGCAGCTGG GCAGTAACTC CACCATGAAG
AAACAGGGCC GTGCGCTGCT CTCCGTTCGC GAAGGCGACA AAGAGCGCGT GGTGGACCTG
GCCGCTAAGC TGCTGAAACA GGGCTTCGAG CTGGATGCTA CCCACGGTAC GGCGATTGTG
CTGGGCGAAG CCGGTATCAA CCCGCGTCTG GTGAACAAGG TGCACGAAGG TCGTCCGCAC
ATTCAGGACC GTATCAAGAA TGGCGAATAT ACCTACATCA TCAACACCAC CGCAGGTCGC
CGCGCGATTG AAGACTCCAG GGTGATTCGC CGCAGCGCGC TGCAGTACAA GGTGCATTAT
GACACCACGC TGAACGGCGG TTTTGCCACG ACGATGGCGC TCAATGCCGA TGCCACCGAG
AAGGTAACCT CGGTGCAGGA AATGCACGCG CAGATCAAAA AGTCGTAA
 
Protein sequence
MPKRTDIKSI LILGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPEM 
ADATYIEPIH WEVVRKIIEK ERPDAVLPTM GGQTALNCAL ELERQGVLEE FGVTMIGATA
DAIDKAEDRR RFDIAMKKIG LDTARSGIAH TMEEALAVAA DVGFPCIIRP SFTMGGTGGG
IAYNREEFEE ICERGLDLSP TNELLIDESL IGWKEYEMEV VRDKNDNCII VCSIENFDAM
GIHTGDSITV APAQTLTDKE YQIMRNASMA VLREIGVETG GSNVQFAVNP KNGRLIVIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELMNDIT GGRTPASFEP SIDYVVTKIP
RFNFEKFAGA NDRLTTQMKS VGEVMAIGRT QQESLQKALR GLEVGATGFD PKVSLDDPEA
LTKIRRELKD AGADRIWYIA DAFRAGLSVD GVFNLTNIDR WFLVQIEELV RLEEKVTEVG
ITGLNADFLR QLKRKGFADA RLAKLAGVRE AEIRKLRDQY DLHPVYKRVD TCAAEFATDT
AYMYSTYEDE CEANPSVDRD KIMVLGGGPN RIGQGIEFDY CCVHASLALR EDGYETIMVN
CNPETVSTDY DTSDRLYFEP VTLEDVLEIV RIEKPKGVIV QYGGQTPLKL ARALEAAGVP
VIGTSPDAID RAEDRERFQH AVDRLKLKQP ANATVTAIEQ AVEKAKEIGY PLVVRPSYVL
GGRAMEIVYD EADLRRYFQT AVSVSNDAPV LLDRFLDDAV EVDVDAICDG EMVLIGGIME
HIEQAGVHSG DSACSLPAYT LSQEIQDVMR QQVQKLAFEL QVRGLMNVQF AVKDNEVYLI
EVNPRAARTV PFVSKATGVP LAKVAARVMA GKSLTEQGVT QEIIPPYYSV KEVVLPFNKF
PGVDPLLGPE MRSTGEVMGV GRTFAEAFAK AQLGSNSTMK KQGRALLSVR EGDKERVVDL
AAKLLKQGFE LDATHGTAIV LGEAGINPRL VNKVHEGRPH IQDRIKNGEY TYIINTTAGR
RAIEDSRVIR RSALQYKVHY DTTLNGGFAT TMALNADATE KVTSVQEMHA QIKKS