Gene Dole_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0120 
Symbol 
ID5692935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp131723 
End bp132865 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content61% 
IMG OID641262697 
Productcarbamoyl-phosphate synthase, small subunit 
Protein accessionYP_001528007 
Protein GI158520137 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCC TGCTTGCCCT TGAAGACGGG CGGACCTTTG CCTGCAAAAG CTTTACCGGA 
CCCGGAGAAA CCGGCGGGGA GATCGTGTTT AACACCGGCA TGTCCGGCTA CCAGGAGGTA
CTTACCGATC CCTCCTACCG GGGCCAGATC GTGACTATGA CCTATCCCCT TATCGGCAAC
TACGGGGTCA ACCACCAGGA TGTGGAGTCG GACCGGGTCC AGGTGGCCGG TTTTGTGGTG
CGCGAATACC AGGACTGCCC CAGCAACTTC CGGTCCGAGC AGTCCCTGGC AAAATACCTG
CAAAGCCAGG GCGTGCTGGG CATCACCGAC CTGGACACCC GTGCCCTGAC CCGCCACCTT
CGCACAGTAG GGGCGTTGCG GGCCTGCATC TCCACCCACG AACTCGATCC CGCGGCCCTG
GTAGAAAAGG CGCTTGCCGT GCCGTCCATG GCAGGGTGCG ACCTTGTCAC CGGCGTGTTT
TCAAAAAAGC CCTATCGCTG GATCAACGGG GCGCCGGCCG CCGTGGACAT GGACCTGGCC
GATATGGATG AGCGGGTGTG GCATAAAACC GGTGGCTTTC GGGTGGCGGC CTTTGATTTC
GGCATCAAGT ACAACATTCT GAGAAATCTT GAGGCGGCCG GGTTCCAGGT GCTGGTGGTG
CCTGCCGGTG CCACGGCCGC CCAGGTAAAA CAGGTCAACC CGGACGGGAT TTTTCTGTCC
AACGGGCCGG GCGACCCCGA GCCGTTGACC GGGCCGGTGG CCACCATTCG CGAGCTTCTG
GACTACCGGC CCATGTTCGG CATCTGCCTG GGAAACCAGT TGGCCGGCCT GGCCCTGGGC
GGCGCCACCT ACAAGCTCAA GTTCGGCCAC CGGGGCGCCA ACCAGCCGGT AAAGGACCTT
GAAACCGGAA AAATCGAAAT CACCTCCCAG AACCACGGGT TTGCCGTGGA TATCGACAGC
CTGAAAAAAG AAAATCTCGT GGTGACCCAC ATCAACCTCA ACGACAACAC CCTGGAGGGG
TTTGCCCACA AAGACATTCC CCTGTTTACC GTCCAGTACC ATCCCGAGGC ATCCCCGGGT
CCCCACGACG CCCGGTACCT GTTTGACCGG TTCAAAGCCC TGATAGAGAA AACCCATGCC
TAA
 
Protein sequence
MKALLALEDG RTFACKSFTG PGETGGEIVF NTGMSGYQEV LTDPSYRGQI VTMTYPLIGN 
YGVNHQDVES DRVQVAGFVV REYQDCPSNF RSEQSLAKYL QSQGVLGITD LDTRALTRHL
RTVGALRACI STHELDPAAL VEKALAVPSM AGCDLVTGVF SKKPYRWING APAAVDMDLA
DMDERVWHKT GGFRVAAFDF GIKYNILRNL EAAGFQVLVV PAGATAAQVK QVNPDGIFLS
NGPGDPEPLT GPVATIRELL DYRPMFGICL GNQLAGLALG GATYKLKFGH RGANQPVKDL
ETGKIEITSQ NHGFAVDIDS LKKENLVVTH INLNDNTLEG FAHKDIPLFT VQYHPEASPG
PHDARYLFDR FKALIEKTHA