Gene Dole_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0472 
Symbol 
ID5693293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp536892 
End bp537962 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content60% 
IMG OID641263055 
Productpeptide chain release factor 1 
Protein accessionYP_001528359 
Protein GI158520489 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000257905 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGACA GACTTCAAGA TGTGGAGCGG CGCTTTGGCG AGCTGGAGGG GCTCATGGCC 
GATCCCTCCA TCGTCAACGA CCGGGAAGCC TACCAGAAGC ACAGCCGGGA GCACGCCGAA
CTTGCCGACA TCGTTGATGC TTACCGGCGG TACAAGGCGG TCAGCGACGA GATCGAAAAG
AGCCGGGACC TCCTGGAAGA CCGGGACCCG GAGATCCGGG AACTTGCCAA AGAAGAGATT
TCCCGCCTGA AAGACGAACG GGAGCGGCTT GACGGCGAGT TGCAGAACCT GCTTACCCCC
AAGGACCCCA ACGACAACAA GAACGTGCTG GTGGAAATCC GGGCCGGAAC CGGCGGGGAA
GAGGCATCTC TGTTTGCCCA CGATCTGTTC CGCATGTACT GGCGGTACGC CGAAACCATG
GGCTGGAAGA CCGAGATCAT GAGCAGCAGC GTGACCGGCT CCGGCGGGTT CAAGGAAGTC
ATCTTCATGG TCTACGGCAA GGGGGCTTAC AGCCATCTCA AGTTTGAAAG CGGTATTCAC
CGGGTGCAGC GGGTACCCGA GACCGAGGCA CAGGGCCGGA TTCATACCTC GGCGGTTACC
GTGGCGGTGC TGGCCGAAGC CGAAGAGGTG GAGCTGCACA TCGACCCGTC TGAGATCAAG
ACCGACGTGT TTCGTTCCAG CGGACCGGGT GGACAGTCGG TCAATACCAC CGACTCGGCG
GTGCGCCTCA CGCATCTGCC CACCGGCGTG GTGGTTATCT GCCAGGATGA AAAGTCCCAG
TTGAAAAACA AAAACAAGGC CATGAAAGTG CTGCGGGCCC GTCTTCTGGA CCGGATGATT
CAGGACCAGA ACGAAAAGAT CGCCCAGAAC CGCAAAGACC AGGTGGGCAG CGGCGACCGG
TCCGGCCGCA TTCGTACCTA CAATTTTCCC CAGGGCCGGG TGACCGACCA CCGGGCCGGC
ATTACCCTTT ATAAACTGGA GAGCGTTCTT CAGGGAGATC TCTCCGAACT GGTTAACGGC
CTCGCCACCT ATTTTCAGGC CGAGCGGCTC AAGCAGGCCG ATGCCTCCTG A
 
Protein sequence
MFDRLQDVER RFGELEGLMA DPSIVNDREA YQKHSREHAE LADIVDAYRR YKAVSDEIEK 
SRDLLEDRDP EIRELAKEEI SRLKDERERL DGELQNLLTP KDPNDNKNVL VEIRAGTGGE
EASLFAHDLF RMYWRYAETM GWKTEIMSSS VTGSGGFKEV IFMVYGKGAY SHLKFESGIH
RVQRVPETEA QGRIHTSAVT VAVLAEAEEV ELHIDPSEIK TDVFRSSGPG GQSVNTTDSA
VRLTHLPTGV VVICQDEKSQ LKNKNKAMKV LRARLLDRMI QDQNEKIAQN RKDQVGSGDR
SGRIRTYNFP QGRVTDHRAG ITLYKLESVL QGDLSELVNG LATYFQAERL KQADAS