Gene Dole_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1659 
Symbol 
ID5694496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1972700 
End bp1973959 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content52% 
IMG OID641264254 
Producthypothetical protein 
Protein accessionYP_001529540 
Protein GI158521670 
COG category[S] Function unknown 
COG ID[COG5338] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000022104 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACAGC TGTTTACGGC AGGAACGTGC CTCTTTCTGT TGCTGATCTA TGGGGCTACA 
GTACATTTAC CGGCCCATGC CGCCTCCGGC ACCATCGGCG GAGATATTTT CGGTCTTCAG
CAGGGAGTCT TTCACACCTT CCTTAACGTC ACCGAAAAAT ACACGGACAA CCTTTACAAT
TCCAGAACCA GCAAAGAATC AGAGCTGATC AGTATTGTTT CCCCTGGCCT GGCCCTGGCC
CTTCCCGGCT CCGACGTGGT GGATATTAAA ATGAACACCG CCACCGGCGC CCCCGGCGGT
CTTGCCATGT CCCGCTATAA AATGGACACC ACCCGGCGCT ACCAGGGCAT GCTGGTTTAC
AACCCGGAAT TTGAGTTCTA TCATGACAAT TCCAGTGAAA ATTTTATCAG CCACAAGGTC
AAGGGAGCAT TTCAATATAA AGCCCCCGGC GGGCTGACCT TTGATGTCGC CGATCATTAT
AAATACGGAC AGGAGATGCG GGGCGAAATC GGCAACCCGG ACCCGGACAC CTATTATTCC
AACGTGGCCC ACGCCATTGT GGAATTTGCC TTTTCCCCGA AATTCAGCAT CGGTGCCGGC
GGTGCGTCCC ACACCATTTC CTACCGGGAA ACAAACTTCC GTGACAGAAA CGACAGGGTC
TACTTCGGCT CCCTCAACTT CCATCCAACG GCAAAAACAC GGCTTTTCTT CGAATACAAA
AACATTGACG TTCGCTACGA TGCCTTTCTC TCCACGGACA AAGAAAATAC AGAAGATCAA
TATTATGCAG GTTTTGCATG GAAGATGACG GCCAAATCAC AGGGTACGTT GAAGGTCGGT
TATATGGCCA AGGATTTTGA TACCCCCGGA ATAGACGACC CCTCGGACTG GGCCGGTGAA
ATCGACCTGA CGCATGCAAT CACCCCGGAC ACGACCATCA TGCTGGGGGC TTCCCGAAAA
TACCACGAAA CCAACATTGC GGCCGCCGAC TACTACACCG CCGATCGCGT CACAGCCATG
TACAGCCAGG CATTCACCCC CAAGCTCAAG GGTGACATGA TGCTCTCTTA CGGCAAAGAC
AACTATGAAG GTATTATCCT TGAGTGCGAC ACCTACATAA TCCGGCCCGC CCTTACCTTC
AAGCCCCGGC GGTGGCTGTC GATTGAACTG GCCTACTCTT ACACTGAGCG TTTTGCCGAC
CTGGCCTCCA TGGACTACAG CACCAACGAT TACACACTGA GGATAGGGGG CACTTTTTAA
 
Protein sequence
MRQLFTAGTC LFLLLIYGAT VHLPAHAASG TIGGDIFGLQ QGVFHTFLNV TEKYTDNLYN 
SRTSKESELI SIVSPGLALA LPGSDVVDIK MNTATGAPGG LAMSRYKMDT TRRYQGMLVY
NPEFEFYHDN SSENFISHKV KGAFQYKAPG GLTFDVADHY KYGQEMRGEI GNPDPDTYYS
NVAHAIVEFA FSPKFSIGAG GASHTISYRE TNFRDRNDRV YFGSLNFHPT AKTRLFFEYK
NIDVRYDAFL STDKENTEDQ YYAGFAWKMT AKSQGTLKVG YMAKDFDTPG IDDPSDWAGE
IDLTHAITPD TTIMLGASRK YHETNIAAAD YYTADRVTAM YSQAFTPKLK GDMMLSYGKD
NYEGIILECD TYIIRPALTF KPRRWLSIEL AYSYTERFAD LASMDYSTND YTLRIGGTF