Gene Dole_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1446 
SymbolcarB 
ID5694283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1721408 
End bp1724608 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content62% 
IMG OID641264041 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001529327 
Protein GI158521457 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAGC GGGACGACAT ACATAAAGTA ATGATCATCG GGTCCGGTCC CATCATCATC 
GGACAGGCCT GCGAGTTTGA CTATTCCGGC ACCCAGGCCT GCAAGGCCCT TCGCAGCCTG
GGCTACACCG TTGTGCTGGT CAACTCCAAC CCGGCCACCA TCATGACGGA CCCTGGTATG
GCGGACATCA CCTATATCGA GCCCCTGAAC GTGGCCACCC TGACTCGGAT CATTGAAAAA
GAGCGGCCCG ACGCCCTTCT GCCCAACCTC GGAGGCCAGT CCGGGCTCAA CCTCTCTTCC
GAGCTCCACC AGGCCGGCGT GCTGGACAAA TACGGGGTCA AGATCATCGG CGTTAACGTG
GATGCCATAA AGCGGGGCGA GGACCGCACC GAGTTCAAGA ACACCATGGA GCGGCTGGGC
ATTGAGATGG CCAGGAGCAG GACGGTCACC ACCGTGGAAG ACGCCGAAAA AGTGGCCGAG
GAGATCGGTT ACCCGGTGGT GATCCGGCCG GCCTACACCA TGGGCGGCAC CGGCGGCGGG
TTTGTCTACA ACGTGGAAGA ACTCCGCGTC ATCGCGGCCC GGGGGCTGGC CGCCAGCATG
GTCAACCAGG TACTGGTGGA GGAGTCGGTA CTGGGCTGGG AAGAGCTGGA GCTGGAGGTG
GTGCGGGACG CCAAAAACCA GAAGATCACG GTCTGCTTCA TTGAGAACGT GGATGCCATG
GGGGTCCACA CCGGTGACTC CTTCTGCACG GCGCCCATGA TGACCATCTC GCCCGCGCTT
CAGGAACGGC TTCAGAAGTA CTCCTATGAT ATCGTGGACG CCATCGAGGT GATCGGCGGC
ACCAACGTGC AGTTTGCCCA CGACCCGGCA ACCGGCCGGG TGGTGGTCAT CGAGATCAAC
CCCCGCACCT CCCGGTCGTC GGCCCTGGCC TCCAAGGCAA CGGGCTTTCC CATTGCCATG
GTATCGGCCC TGCTGGCCGG GGGGCTGACC CTGGATGAGA TTCCCTACTG GCGGGATGGC
ACCCTGGAAA AGTACACCCC CTCCGGGGAT TACGTGGTGG TAAAGTTTGC CAAGTGGGCT
TTTGAAAAGT TTGTCGGCGC CGAAGATGTG CTGGGCACCC AGATGAAGGC CGTGGGCGAG
GTGATGAGCA TCGGGAAAAA CTACAAGGAG GCCCTGCAGA AGGCGATCCG GTCCCTGGAA
AACGGCCGCC ACGGACTGGG CTTTGCCAAA AACTTCAACA CGATCTCCTT AGATGACCTG
ATGGCAAAGC TGCGTAAGCC CTCCAGCGAG AGGCAGTTTA TTATGTACGA GGCCCTGCGA
AAAGGGGCAA CCATCGAGGC CCTGCACGGG CTGACCCACA TCAAGGCCTG GTTTATCGAG
CAGATGAAGG AACTGGTGGA CCTGGAAGAG ACACTGATCA AACACCGGGG AAACCTGCCG
CCGGACGACC TGTTTGTGAC GGCCAAAAAG GACGGGTTTG CCGACGCCTA CCTGTCAAAA
ATTCTGGCCG TGCCCGAGAC CGAGATCCGG AAAAAGCGCC TCTCCCTGGG CCTGGCCGAG
GCCTGGGAGC CGGTGCCGGT AAGCGGGGTG GAGAACGCGG CCTACTACTA CTCCACCTAC
AACGCCCCGG ACCAGGTGGC GGTGTCGGAA AACAGGAAGG TCATGGTGCT GGGCGGCGGC
CCCAACCGCA TCGGCCAGGG CATTGAGTTC GATTACTGCT GCGTTCACGC CGCCTTTGCC
ATTCGGGATC AGGGGCTGGA ATCGATCATG GTCAACTGCA ATCCGGAAAC GGTCTCCACG
GATTACGACA CATCCAATAA GCTCTATTTC GAACCCCTGA CCGTGGAGGA TGTGCTCAGC
ATCTACGCAA AGGAAAAGCC CGATGGCGTG ATCGTGCAGT TCGGCGGCCA GACCCCGCTC
AACCTCGCCA GGGCACTGGA AGCGGCGGGC GTCAACATCC TTGGTACCTC GCCGGACACC
ATCGACCTGG CCGAGGACCG GGACCGGTTC CGTCAGGTGA TGCAGGACTT GGGCATTCCC
CAGCCCGAAT CGGGCATGGC CAGCACCCTG GACCAGGCCC TGGAGATCGC GGCCCGCATT
GGCTATCCGC TGATGGTGCG GCCCTCCTAT GTGCTGGGGG GCAGGGCCAT GGAGGTGGTG
GCCGATGAAG AGATGCTGCG CCAGTATGTG ACGGCGGCCG TGGACGTGTC GCCGGACCGG
CCCATTCTCA TCGACAAGTT CCTGGAAAAC GCCATCGAGG CCGAGGCCGA CGCTATTGCC
GACGGCACCG ACGCCTTTGT GCCCGCCGTG ATGGAGCATA TCGAACTGGC CGGAGTCCAT
TCCGGAGACT CGGCCTGCGT GCTGCCGCCG GTCTCCATTC CGGAAAAACA CATCAACACC
ATTGTGGACT ACACGCGGAA GATCGCCATG ACCCTGAAGG TGGTGGGGCT GATGAACATT
CAGTACGCCA TTGCCGACGA CTGCGTCTAT ATTCTGGAAG CCAACCCCCG GGCCTCCCGC
ACCGTGCCCC TGGTCTCCAA GGTGTGCAAT ATTCCCATGG CCCGGTACGC GGCACAGATC
ATGATGGGCG AGACCTTGGC CGACCTGGAT TTAAAGCCGC GCAAGGTCCG CCATTTCGGC
GTCAAGGAGT CGGTTTTTCC GTTTAACATG TTTCCCGAGG TCGACCCGGT GCTGGGGCCG
GAGATGCGCT CCACAGGCGA GGTGCTGGGC ATTGCAGACT CCTTCGGCTA CGCCTTTTTC
AAGGCCCAGG AGGCCACCCA GGCCCCGCTG CCCACCGGCG GGGCCGTGCT GATCACCGTG
GCCGACAAGG ACAAGCAGGC CATTCTGGAA ACGGCTCGCC TGTTCAGCGA TCTGGGCTTT
ACCGTGCTGG CCACCCAGGG CACCGGCGAG TTTCTCTCCC GCCAGGGCAT TGCCGCCCAG
GCTGTCACCA AGCTGGGCCA TGGCCGGCCC GACATCGTGG ACCTGATCAA GAACGGCGAT
ATCCAACTGC TGGTCAACAC GCCGGGCGGC AAGGCCAGCA AGGAGGATGA CTCCTATATC
CGCAAGGCGG CGGTCAAGTA CAAGGTGCCG TACATGACCA CCGTAGCCGC CTCCCTGGCC
GCGGCCCGGG GCATTGCCGC ACGGAACCGG GGCGAAGAGC AGATCCATTC GCTTCAGGAG
TACCACGCCA ACATCACCTG A
 
Protein sequence
MPKRDDIHKV MIIGSGPIII GQACEFDYSG TQACKALRSL GYTVVLVNSN PATIMTDPGM 
ADITYIEPLN VATLTRIIEK ERPDALLPNL GGQSGLNLSS ELHQAGVLDK YGVKIIGVNV
DAIKRGEDRT EFKNTMERLG IEMARSRTVT TVEDAEKVAE EIGYPVVIRP AYTMGGTGGG
FVYNVEELRV IAARGLAASM VNQVLVEESV LGWEELELEV VRDAKNQKIT VCFIENVDAM
GVHTGDSFCT APMMTISPAL QERLQKYSYD IVDAIEVIGG TNVQFAHDPA TGRVVVIEIN
PRTSRSSALA SKATGFPIAM VSALLAGGLT LDEIPYWRDG TLEKYTPSGD YVVVKFAKWA
FEKFVGAEDV LGTQMKAVGE VMSIGKNYKE ALQKAIRSLE NGRHGLGFAK NFNTISLDDL
MAKLRKPSSE RQFIMYEALR KGATIEALHG LTHIKAWFIE QMKELVDLEE TLIKHRGNLP
PDDLFVTAKK DGFADAYLSK ILAVPETEIR KKRLSLGLAE AWEPVPVSGV ENAAYYYSTY
NAPDQVAVSE NRKVMVLGGG PNRIGQGIEF DYCCVHAAFA IRDQGLESIM VNCNPETVST
DYDTSNKLYF EPLTVEDVLS IYAKEKPDGV IVQFGGQTPL NLARALEAAG VNILGTSPDT
IDLAEDRDRF RQVMQDLGIP QPESGMASTL DQALEIAARI GYPLMVRPSY VLGGRAMEVV
ADEEMLRQYV TAAVDVSPDR PILIDKFLEN AIEAEADAIA DGTDAFVPAV MEHIELAGVH
SGDSACVLPP VSIPEKHINT IVDYTRKIAM TLKVVGLMNI QYAIADDCVY ILEANPRASR
TVPLVSKVCN IPMARYAAQI MMGETLADLD LKPRKVRHFG VKESVFPFNM FPEVDPVLGP
EMRSTGEVLG IADSFGYAFF KAQEATQAPL PTGGAVLITV ADKDKQAILE TARLFSDLGF
TVLATQGTGE FLSRQGIAAQ AVTKLGHGRP DIVDLIKNGD IQLLVNTPGG KASKEDDSYI
RKAAVKYKVP YMTTVAASLA AARGIAARNR GEEQIHSLQE YHANIT