Gene Dole_0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0059 
Symbol 
ID5692873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp65937 
End bp67118 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content64% 
IMG OID641262635 
Productarginine biosynthesis bifunctional protein ArgJ 
Protein accessionYP_001527946 
Protein GI158520076 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTGCC CCGGTTTTCA ATGGGCCGGT GTGTGCGCGG GCATCAAAAA CTTGAAGAAA 
AAAGACCTGG GCCTGCTGGT GTCTGACACG CCGGCCGCCG TGGCCGCGGT GTTTACCGCG
AACAGGGTCA AGGCTGCGCC CGTGCTGCTG GACATGGAGC GGGTGGCATC GGGCCGGTGC
CGGGCCATTG TGGCCAACAG CGGCAATGCC AACTGCTGCA CCGGGGATGC GGGGATGGCC
GCCGCCCTTG CCATGACAAA GGCCGTGGCC GAGGCCCTGG GCATTGATGA GGCCCTGGTG
CTGGTGGCCT CCACCGGCGT TATCGGGGCG CCCATGCCCA CGGAAACCAT CACCGGCGCC
GTGCCCGGCC TGGTAAAGGC GTTACGTCCG GACGGGCTGC CCGACTTTTC CGAAAGCATT
TTAACCACAG ACCGGTTTGC CAAAAGTGCC CTGCGAAACG TTCGCCTGGA AAACGGAACA
ACCGTCACGG TCTGCGCCAC GGCCAAGGGA GCCGGCATGA TCCGGCCGGA CATGGCCACC
ATGCTCTGCT TTGTCTGCAC CGACCTTCAG GCTGATACCG ACGCTCTTTC CGGCATGCTT
TCCGTTGCTG TGGACCGCTC CTTTAACCGC ATCACCGTGG ACGGCGACAC CAGCACCAAC
GACACGGTGT TTTTAATGGC CGGCGGCGCG TCCGGCGCGG GCCTGCAAAC GGATGCCGAC
CGGCAGGGAT TTCAACAGGC CCTGGACGAC GTGCTGACCG AGCTGGCCCG AATGATGGTG
ACGGACGGGG AGGGGGCCAC CAAGCTGGTG GAGGTGCGGG TGAAGGGGGC CAAGTCTGAC
GCCGATGCCC GGCGCGTGGC CGACACCGTG GCCAATTCCA GCCTTGTGAA AACCGCTTTT
TTCGGCCAGG ACGCCAACTG GGGCCGGATC ATGGCCGCCG CGGGCCGGGC CGGGGTGGAC
CTGTCCCCGG ACGCGGTGGA TATCTTTTTT GATGATGTGC AAATGGTGAA AAACGGCATG
GGATGCGGCC CCGAAGCCGA ACGCAAGGCG TCGGGGGTGC TCAAACAGCC GACCATCTGC
CTGGGCATTG ACCTGAACAC CGGCGGCACC GGCGCGGCAA CGGTGCTGAC CTGTGACCTG
TCTATTGAGT ATGTGAAGAT CAACGCCGAC TACCGGACAT GA
 
Protein sequence
MECPGFQWAG VCAGIKNLKK KDLGLLVSDT PAAVAAVFTA NRVKAAPVLL DMERVASGRC 
RAIVANSGNA NCCTGDAGMA AALAMTKAVA EALGIDEALV LVASTGVIGA PMPTETITGA
VPGLVKALRP DGLPDFSESI LTTDRFAKSA LRNVRLENGT TVTVCATAKG AGMIRPDMAT
MLCFVCTDLQ ADTDALSGML SVAVDRSFNR ITVDGDTSTN DTVFLMAGGA SGAGLQTDAD
RQGFQQALDD VLTELARMMV TDGEGATKLV EVRVKGAKSD ADARRVADTV ANSSLVKTAF
FGQDANWGRI MAAAGRAGVD LSPDAVDIFF DDVQMVKNGM GCGPEAERKA SGVLKQPTIC
LGIDLNTGGT GAATVLTCDL SIEYVKINAD YRT