Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1446 |
Symbol | carB |
ID | 5694283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1721408 |
End bp | 1724608 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641264041 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_001529327 |
Protein GI | 158521457 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAGC GGGACGACAT ACATAAAGTA ATGATCATCG GGTCCGGTCC CATCATCATC GGACAGGCCT GCGAGTTTGA CTATTCCGGC ACCCAGGCCT GCAAGGCCCT TCGCAGCCTG GGCTACACCG TTGTGCTGGT CAACTCCAAC CCGGCCACCA TCATGACGGA CCCTGGTATG GCGGACATCA CCTATATCGA GCCCCTGAAC GTGGCCACCC TGACTCGGAT CATTGAAAAA GAGCGGCCCG ACGCCCTTCT GCCCAACCTC GGAGGCCAGT CCGGGCTCAA CCTCTCTTCC GAGCTCCACC AGGCCGGCGT GCTGGACAAA TACGGGGTCA AGATCATCGG CGTTAACGTG GATGCCATAA AGCGGGGCGA GGACCGCACC GAGTTCAAGA ACACCATGGA GCGGCTGGGC ATTGAGATGG CCAGGAGCAG GACGGTCACC ACCGTGGAAG ACGCCGAAAA AGTGGCCGAG GAGATCGGTT ACCCGGTGGT GATCCGGCCG GCCTACACCA TGGGCGGCAC CGGCGGCGGG TTTGTCTACA ACGTGGAAGA ACTCCGCGTC ATCGCGGCCC GGGGGCTGGC CGCCAGCATG GTCAACCAGG TACTGGTGGA GGAGTCGGTA CTGGGCTGGG AAGAGCTGGA GCTGGAGGTG GTGCGGGACG CCAAAAACCA GAAGATCACG GTCTGCTTCA TTGAGAACGT GGATGCCATG GGGGTCCACA CCGGTGACTC CTTCTGCACG GCGCCCATGA TGACCATCTC GCCCGCGCTT CAGGAACGGC TTCAGAAGTA CTCCTATGAT ATCGTGGACG CCATCGAGGT GATCGGCGGC ACCAACGTGC AGTTTGCCCA CGACCCGGCA ACCGGCCGGG TGGTGGTCAT CGAGATCAAC CCCCGCACCT CCCGGTCGTC GGCCCTGGCC TCCAAGGCAA CGGGCTTTCC CATTGCCATG GTATCGGCCC TGCTGGCCGG GGGGCTGACC CTGGATGAGA TTCCCTACTG GCGGGATGGC ACCCTGGAAA AGTACACCCC CTCCGGGGAT TACGTGGTGG TAAAGTTTGC CAAGTGGGCT TTTGAAAAGT TTGTCGGCGC CGAAGATGTG CTGGGCACCC AGATGAAGGC CGTGGGCGAG GTGATGAGCA TCGGGAAAAA CTACAAGGAG GCCCTGCAGA AGGCGATCCG GTCCCTGGAA AACGGCCGCC ACGGACTGGG CTTTGCCAAA AACTTCAACA CGATCTCCTT AGATGACCTG ATGGCAAAGC TGCGTAAGCC CTCCAGCGAG AGGCAGTTTA TTATGTACGA GGCCCTGCGA AAAGGGGCAA CCATCGAGGC CCTGCACGGG CTGACCCACA TCAAGGCCTG GTTTATCGAG CAGATGAAGG AACTGGTGGA CCTGGAAGAG ACACTGATCA AACACCGGGG AAACCTGCCG CCGGACGACC TGTTTGTGAC GGCCAAAAAG GACGGGTTTG CCGACGCCTA CCTGTCAAAA ATTCTGGCCG TGCCCGAGAC CGAGATCCGG AAAAAGCGCC TCTCCCTGGG CCTGGCCGAG GCCTGGGAGC CGGTGCCGGT AAGCGGGGTG GAGAACGCGG CCTACTACTA CTCCACCTAC AACGCCCCGG ACCAGGTGGC GGTGTCGGAA AACAGGAAGG TCATGGTGCT GGGCGGCGGC CCCAACCGCA TCGGCCAGGG CATTGAGTTC GATTACTGCT GCGTTCACGC CGCCTTTGCC ATTCGGGATC AGGGGCTGGA ATCGATCATG GTCAACTGCA ATCCGGAAAC GGTCTCCACG GATTACGACA CATCCAATAA GCTCTATTTC GAACCCCTGA CCGTGGAGGA TGTGCTCAGC ATCTACGCAA AGGAAAAGCC CGATGGCGTG ATCGTGCAGT TCGGCGGCCA GACCCCGCTC AACCTCGCCA GGGCACTGGA AGCGGCGGGC GTCAACATCC TTGGTACCTC GCCGGACACC ATCGACCTGG CCGAGGACCG GGACCGGTTC CGTCAGGTGA TGCAGGACTT GGGCATTCCC CAGCCCGAAT CGGGCATGGC CAGCACCCTG GACCAGGCCC TGGAGATCGC GGCCCGCATT GGCTATCCGC TGATGGTGCG GCCCTCCTAT GTGCTGGGGG GCAGGGCCAT GGAGGTGGTG GCCGATGAAG AGATGCTGCG CCAGTATGTG ACGGCGGCCG TGGACGTGTC GCCGGACCGG CCCATTCTCA TCGACAAGTT CCTGGAAAAC GCCATCGAGG CCGAGGCCGA CGCTATTGCC GACGGCACCG ACGCCTTTGT GCCCGCCGTG ATGGAGCATA TCGAACTGGC CGGAGTCCAT TCCGGAGACT CGGCCTGCGT GCTGCCGCCG GTCTCCATTC CGGAAAAACA CATCAACACC ATTGTGGACT ACACGCGGAA GATCGCCATG ACCCTGAAGG TGGTGGGGCT GATGAACATT CAGTACGCCA TTGCCGACGA CTGCGTCTAT ATTCTGGAAG CCAACCCCCG GGCCTCCCGC ACCGTGCCCC TGGTCTCCAA GGTGTGCAAT ATTCCCATGG CCCGGTACGC GGCACAGATC ATGATGGGCG AGACCTTGGC CGACCTGGAT TTAAAGCCGC GCAAGGTCCG CCATTTCGGC GTCAAGGAGT CGGTTTTTCC GTTTAACATG TTTCCCGAGG TCGACCCGGT GCTGGGGCCG GAGATGCGCT CCACAGGCGA GGTGCTGGGC ATTGCAGACT CCTTCGGCTA CGCCTTTTTC AAGGCCCAGG AGGCCACCCA GGCCCCGCTG CCCACCGGCG GGGCCGTGCT GATCACCGTG GCCGACAAGG ACAAGCAGGC CATTCTGGAA ACGGCTCGCC TGTTCAGCGA TCTGGGCTTT ACCGTGCTGG CCACCCAGGG CACCGGCGAG TTTCTCTCCC GCCAGGGCAT TGCCGCCCAG GCTGTCACCA AGCTGGGCCA TGGCCGGCCC GACATCGTGG ACCTGATCAA GAACGGCGAT ATCCAACTGC TGGTCAACAC GCCGGGCGGC AAGGCCAGCA AGGAGGATGA CTCCTATATC CGCAAGGCGG CGGTCAAGTA CAAGGTGCCG TACATGACCA CCGTAGCCGC CTCCCTGGCC GCGGCCCGGG GCATTGCCGC ACGGAACCGG GGCGAAGAGC AGATCCATTC GCTTCAGGAG TACCACGCCA ACATCACCTG A
|
Protein sequence | MPKRDDIHKV MIIGSGPIII GQACEFDYSG TQACKALRSL GYTVVLVNSN PATIMTDPGM ADITYIEPLN VATLTRIIEK ERPDALLPNL GGQSGLNLSS ELHQAGVLDK YGVKIIGVNV DAIKRGEDRT EFKNTMERLG IEMARSRTVT TVEDAEKVAE EIGYPVVIRP AYTMGGTGGG FVYNVEELRV IAARGLAASM VNQVLVEESV LGWEELELEV VRDAKNQKIT VCFIENVDAM GVHTGDSFCT APMMTISPAL QERLQKYSYD IVDAIEVIGG TNVQFAHDPA TGRVVVIEIN PRTSRSSALA SKATGFPIAM VSALLAGGLT LDEIPYWRDG TLEKYTPSGD YVVVKFAKWA FEKFVGAEDV LGTQMKAVGE VMSIGKNYKE ALQKAIRSLE NGRHGLGFAK NFNTISLDDL MAKLRKPSSE RQFIMYEALR KGATIEALHG LTHIKAWFIE QMKELVDLEE TLIKHRGNLP PDDLFVTAKK DGFADAYLSK ILAVPETEIR KKRLSLGLAE AWEPVPVSGV ENAAYYYSTY NAPDQVAVSE NRKVMVLGGG PNRIGQGIEF DYCCVHAAFA IRDQGLESIM VNCNPETVST DYDTSNKLYF EPLTVEDVLS IYAKEKPDGV IVQFGGQTPL NLARALEAAG VNILGTSPDT IDLAEDRDRF RQVMQDLGIP QPESGMASTL DQALEIAARI GYPLMVRPSY VLGGRAMEVV ADEEMLRQYV TAAVDVSPDR PILIDKFLEN AIEAEADAIA DGTDAFVPAV MEHIELAGVH SGDSACVLPP VSIPEKHINT IVDYTRKIAM TLKVVGLMNI QYAIADDCVY ILEANPRASR TVPLVSKVCN IPMARYAAQI MMGETLADLD LKPRKVRHFG VKESVFPFNM FPEVDPVLGP EMRSTGEVLG IADSFGYAFF KAQEATQAPL PTGGAVLITV ADKDKQAILE TARLFSDLGF TVLATQGTGE FLSRQGIAAQ AVTKLGHGRP DIVDLIKNGD IQLLVNTPGG KASKEDDSYI RKAAVKYKVP YMTTVAASLA AARGIAARNR GEEQIHSLQE YHANIT
|
| |