Gene Dole_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2226 
Symbol 
ID5695073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2698194 
End bp2699621 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content61% 
IMG OID641264831 
Productputative aminopeptidase 1 
Protein accessionYP_001530107 
Protein GI158522237 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGGTC AAATGAGCAA AAAAGAGCTG GACGCCTTTT CAAAGAAGAC TATTCGGAAG 
CCCGCCCTGG TGTGGGACGC GCTTTCTCCT GCCGAGACCC GGCAGTCCTT TGATTTTGCT
GAAAACTACA AGCGGTTTCT GGACGATGCC AAAACCGAGC GAAAGGCTGT GACCGTTATT
CAGAAGGCCC TGGCCGCCGC CGGCTTTGTG GACATTGACG GCCGGGCCAA AGGCAAGGGG
AAATTCTACA AGGTGTTTCG CAACAAGGCG GTGGCCGCCG CGGTTGTGGG CAGCGCCCCC
CTGGACCAGG GCATGCGGAT CATCGCGGCC CACGTGGACT CGCCCCGACT GGACCTCAAG
CAGAACCCCC TTTACGAAGA GGTGGACCTG GCCATGCTCA AGGTCCACTA CTACGGCGGC
ATTCGCAAAT ACCAGTGGCT GGCCCGGCCC CTGGCCCTTT ACGGAACAGT GGTGGGCAAA
GACGGCCGGT CCTTTGATGT GGAGATCGGC GAGGCGGAAA CTGATCCGGT GATCACCATT
GCCGACCTGC TGCCCCACCT GGCGGCCAAA CTGCAGAACA GCAAAAAATT GTCTGATGTG
TTTGAGGCCG AAAAACTCAA CCTGGTGGCC GGCAGCCTGC CCGCCGGTGA CGAGAAGCAG
AAGGACCGGT TCAAGCTTAC CGTGCTCAAA TACCTGTTTG ACCGGTACGG CCTGGTGGAG
GAGGATTTTG CCAGCGCCGA GCTGGAGGCG GTGCCTGCTG GAAGGGCCAG GGACGTGGGG
TTTGACCGCG GCCTGATCGG GGCCTACGGC CAGGATGACC GGGTTTGCGC CTACACGGCC
CTGGCCGCGA TTCAGGACCT GAAGAAGCCG CCCCGGACGG CCGTGGCCCT GTTTTTCGAC
AAGGAGGAGA TCGGCAGCGA AGGCAACTCC GGCGCCCGGT CCCGGTTCAT GGAGGACTTT
ATCGCCGACC TGTTTGAAAA ACAGGACGCA CCGGTTTCCG AACGGGTGCT GCGAAAGGCC
ATCACCGCTT CAGAGGCCCT TTCCGCGGAC GTGAACGCGG CCCTGGACCC GGACTACCAG
GAGGTCCATG AAAAGCGCAA CGCGGCCCGT CTGGGATACG GTATCTGCAT CACCAAGTTC
ACCGGTTCCG GTGGCAAGTC CGGGTCCAGT GACGCCAGTG CCGAATACGT GGGCCGGGTA
CGGCAGATAT TCAACCGGGC CGGCATCGTG TGGCAGACCG GTGAGCTGGG CCGGGTGGAC
CAGGGCGGCG GCGGCACCCT GGCCAAGTTC CTGGCCGCCT ACGGCATGGA TATCGTGGAT
TGCGGCCCGG CCCTGCTCTC CATGCACTCA CCCTTTGAAC TCTCCAGCAA GGCCGATGTG
TACATGACCT TCAAAGCCTT CAAAGCGTTT TTCGATGACC GGCAGTAA
 
Protein sequence
MTGQMSKKEL DAFSKKTIRK PALVWDALSP AETRQSFDFA ENYKRFLDDA KTERKAVTVI 
QKALAAAGFV DIDGRAKGKG KFYKVFRNKA VAAAVVGSAP LDQGMRIIAA HVDSPRLDLK
QNPLYEEVDL AMLKVHYYGG IRKYQWLARP LALYGTVVGK DGRSFDVEIG EAETDPVITI
ADLLPHLAAK LQNSKKLSDV FEAEKLNLVA GSLPAGDEKQ KDRFKLTVLK YLFDRYGLVE
EDFASAELEA VPAGRARDVG FDRGLIGAYG QDDRVCAYTA LAAIQDLKKP PRTAVALFFD
KEEIGSEGNS GARSRFMEDF IADLFEKQDA PVSERVLRKA ITASEALSAD VNAALDPDYQ
EVHEKRNAAR LGYGICITKF TGSGGKSGSS DASAEYVGRV RQIFNRAGIV WQTGELGRVD
QGGGGTLAKF LAAYGMDIVD CGPALLSMHS PFELSSKADV YMTFKAFKAF FDDRQ