Gene Dole_0896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0896 
Symbol 
ID5693731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1047736 
End bp1048965 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content58% 
IMG OID641263493 
ProductHipA domain-containing protein 
Protein accessionYP_001528783 
Protein GI158520913 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGGC TTGATGTCTG GCTGACGGTG CCATCAGGCA AAAGTATCAA AGCGGGCAGC 
CTGGTTGTCG CTGATCCTGA CACTGCCGGG GGAGGGCGGT TACGGGGGCA GTTCCGCTAC
AATCCCGAAT ACCTGGAACG CCCCGAGGCC TTTGCGCTTG ATCCCCTGAA CCTGCCTTTG
TCCGCGGAAA TCTTTGATGC AAATCGCCCC CGGGCCGGTG TCCATGGCGT TTTTGAAGAC
AGCCTGCCGG ATGACTGGGG CCGGCGGCTC ATGATCCGCC GTTACAATCT GAAACGTGAT
GAACAGCGTG TCCCCAATCT CTTACGGCTG CTGGGCGGCA AGGGGTTGGG GGCGTTGGGC
TACGCAGAAG AGGGCTCACC CGGGCCGGAA ACAACAAGTG TCTCCAGCCG GTACTTGCAG
GAACTGGCCT TGCTTGCGGA AAAATTCGAG CAGGACCCCG CTGCCGCAAC CGACGATGAA
TTCTCCCTTC TGTTCCAGGC CGGCAGTTCG CCCGGCGGGG CCAGGCCCAA GGTCCTTGTC
GCGGATGAAA ACGGCTCATA TCTGGCAAAA TTTGCCAGCG CCGGGGATCG ACTGGACGTG
GTCAGCCTGG AGGCGGCAGC CATGGAACTG GCCCGCCGGG CCGGAATAGA CACGGCCGGG
ACCAGGCTTG TGCCCCTGGG CACAACCGGA AAATGCCTGC TGGTGAAGCG GTTCGATATC
AACGCGGCAG GCGGTCGCAA TCACCTGGTC AGCATGCAGA CACTGCTCAG GGCCGATGAC
TATTATAACG CCGGTTACCG CGACCTGGCC GAAGTTATAC GGCACATTTC ATCACAACCC
GCCCATGATC TTCACCGGCT ATACCGGCAG ATGGTATTCA ACGTGCTGAT CGGCAACACC
GATGATCATC TCAAAAACTT TCTCATGCTG CATGATGAAA CCGGCTGGCG CTTGAGCCCC
GCCTTTGACT TGATACCGAA TATCGGTTTC AACCGGGAAC ATGTGCTGCG AATCGGCCTG
GATTCAAGAC CGCCGGATTT TGAAACCCTG CTGGCCGAGT CAAAGTACTT TGGAATCAAG
CGGCGGCAGG AAGCACGAAA TACAATTATG GAAATTAATG CAGCCGTCAT GGAATGGCCC
GGCATTTTCA AGAAATGCCA TGTGCCGGCA AGGGACGCGG ATAGCATCGG GAAGGATATC
ATGCGGCGTA CTTGTCGTAC GCCGCAGTGA
 
Protein sequence
MIRLDVWLTV PSGKSIKAGS LVVADPDTAG GGRLRGQFRY NPEYLERPEA FALDPLNLPL 
SAEIFDANRP RAGVHGVFED SLPDDWGRRL MIRRYNLKRD EQRVPNLLRL LGGKGLGALG
YAEEGSPGPE TTSVSSRYLQ ELALLAEKFE QDPAAATDDE FSLLFQAGSS PGGARPKVLV
ADENGSYLAK FASAGDRLDV VSLEAAAMEL ARRAGIDTAG TRLVPLGTTG KCLLVKRFDI
NAAGGRNHLV SMQTLLRADD YYNAGYRDLA EVIRHISSQP AHDLHRLYRQ MVFNVLIGNT
DDHLKNFLML HDETGWRLSP AFDLIPNIGF NREHVLRIGL DSRPPDFETL LAESKYFGIK
RRQEARNTIM EINAAVMEWP GIFKKCHVPA RDADSIGKDI MRRTCRTPQ