Gene Dole_2215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2215 
Symbol 
ID5695061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2687919 
End bp2689103 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content63% 
IMG OID641264819 
Producthypothetical protein 
Protein accessionYP_001530096 
Protein GI158522226 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00110232 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATCT ATTTTGACTG CGGTTCCGGC ATCAGCGGCG ACATGACCCT GGCCGCGTTT 
GTGGACCTGG GCGTGCCTGT GGAGTGGCTC AGGCAGGAGC TGGCCCGCCT GCCCTTGGAC
GGGTTTGACA TTCAGGAATC CGTTGTGTCG CGTAACGGGA TTCGGGCGAA AAAAATCGAC
GTAATCGACA TTCATGGCGA TGGGGATGAC GGCCACGCCC ATGGCCGCCA CTACGCGGCC
ATCCGTTCCC TGATCGGCGG CAGCCCCCTG CCCGAGCGGG TCAAGCAGAT CGCCCTGGCC
GCCTTTGAAA AGCTGGCAAA GGCCGAAGCC GCTGTTCACG GCGTGCCTCT GGAAAAGGTT
CACTTTCACG AAGTGGGCGG GGTGGATGCC ATCGTGGACA TCGTGGGCGC GGCCCTGTGC
GTGGCCCACC TGAAGGTCGG AAAAATCGCC GCCTCAGCAG TGCCCCTGGG CTCCGGCCAT
GTGACGTGCA GGCACGGGGT GCTGCCGGTG CCGGCACCGG CCACGGTGGG AATCTTAAAA
GGCGTGCCGG TGTACGGCAC GAACGTGGAT TGTGAACTGG TGACCCCCAC AGGCGCGGCC
ATTCTTACCA CACTGGCAAC CGAATTTGGC CCCATGCCGG CCATGACCAT GGAAAAGACC
GGTTACGGCG CCGGTACCCG GGAGATTGCG GCCATGCCCA ACCTGCTGCG CCTCATTTCC
GGCAGATTTC AGACTGCCGG TGCGGCAGAC GGCCATGCCG ACCTGATGAT GGAGACCTGC
ATTGACGATA TGACCCCCGA AATCTGGGGC CATGTCATGG AACGGCTCTT TGAAGCCGGG
GCCAGGGATG TTTATTTTGT CCCGGTGCAC ATGAAAAAGA ACCGGCCCGG TATCCTGCTT
TGCGTACTGT GCGACAAGGC CCAACGCGAG GCCCTGGCCG CCTGCATCCT GTCTGAAACC
ACCTCCATCG GCGTGCGCTA CTATCCGGTG GAGCGCATCA TGCTGGCCCG CCGGCCGGTC
ACGGTGAAAA CGGCCTTTGG CCCGGTCCAG GCCAAGGCCG TCACCCTGCC CAACGGCACC
ACCCGCATCG CCCCGGAGTA TGAAGCCTGC CGCAAGATCT CCCTGGAACG CAAGGTTTCA
ATCCTGGAGA TTTACCGGGC GACGGCGGAT GGCCCGGTAG TTTAG
 
Protein sequence
MDIYFDCGSG ISGDMTLAAF VDLGVPVEWL RQELARLPLD GFDIQESVVS RNGIRAKKID 
VIDIHGDGDD GHAHGRHYAA IRSLIGGSPL PERVKQIALA AFEKLAKAEA AVHGVPLEKV
HFHEVGGVDA IVDIVGAALC VAHLKVGKIA ASAVPLGSGH VTCRHGVLPV PAPATVGILK
GVPVYGTNVD CELVTPTGAA ILTTLATEFG PMPAMTMEKT GYGAGTREIA AMPNLLRLIS
GRFQTAGAAD GHADLMMETC IDDMTPEIWG HVMERLFEAG ARDVYFVPVH MKKNRPGILL
CVLCDKAQRE ALAACILSET TSIGVRYYPV ERIMLARRPV TVKTAFGPVQ AKAVTLPNGT
TRIAPEYEAC RKISLERKVS ILEIYRATAD GPVV