Gene Dole_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1859 
Symbol 
ID5694699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2249026 
End bp2251086 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content61% 
IMG OID641264457 
ProductTonB-dependent receptor plug 
Protein accessionYP_001529740 
Protein GI158521870 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000014407 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTG CAATTCGTAC CCTGCTTTTG ACGGGTTGTA TAATTTACTG TGGTGCCGCC 
GTGGCTGTTC AGGCGGATGA ACAAAAGGCC CAGGCCACGG CCATGCTGGA TGAGATCGTG
GTGACGGCCA CCAAGACCGA GGAGACCCGG AAAGATGTAC CGAATGCCGT AATCGTTATC
GACCAGGCCG CTATTGAAGC CTCCACCGCC GACACGGTGG GGGAGCTGCT GGCCAATGAG
CCGGGCGTTG ATTTCCGGAC CCGGGGCGAC TACGGCGGCG CGGCCCAGTC CCTTAACATC
CGGGGCATGA GCGACACAGA GGTCCAGGTC ATGGTCAACG GCGTTTCCGC CAACTCCCCT
TCCCTGGGAT CGGCGGATAT CGGCACCATT CCTTTGAGCA GCATCGAGAG AATCGAGATC
GTAAAGGGAT CAGGCTCCAT GCTGCACGGT TCCGGGGCCA TGGCCGGGGC GGTCAACATT
ATCACCAAGC GGCCGAAACG GGATCGCATG GACGCAAAGG CCTCGGCCGG TTACGGCACG
GACAACACCT ACCAGCTTGC CGCCGAGCAC GGGCAGTACA TCGGCGACTT CGGCTACTAT
CTTACTGCCG GGCAGCAGGA GACCGACGGC CGGCGGGACA ACGGCGACAT GGAGCAGCAG
GACGCCTCCC TGGCCCTGGT TTTAGACAGG GATGACCTTC TGAATGTCAC CCTGAACGGC
AGCGTGGTGG ACCGGGAGTT CGGCGTGCCG GGCGTCAAGC CGCCGGCCGG CACCGGCACC
CATTACGTGG GCGGCGTGGC GTTTTACAAC GCGGATTCGG CCAGCCTGGT GGATAACGGC
AGCGACACCA CCTACCAGGC ATCCCTGGAG ATTAAAAGCC GGCCGACCGA CTGGCTGGCC
GTCAACCTGA AGCCTTTTTA CAGTGACCTG GAGAACTACC ACTCTACCCG GTACAACGAC
ACCTCCTGGG GCCACCTGGC CGGCGAGGGG TACAAAACCT GGGTTTACAA CACGGTAAAA
GGAATTGACG GTCATGTGGC CCTGGATCCG GTCGACGGCC TCACCCTGCT GCTGGGCGGT
GACTACAAAG ATTATGAGTG GGAGACCGAG CAGTCTTTTC TGGATGTCAG CGGCGCGTTC
AACCCTGCCA TTCCCGTGGC CACCAACGAT GCCAAGCTCT TTACCAAAAG CTGTTTCGGA
GAGGTGCAGT ACCGGCCCAG CCAGTATGTC AAGTTGCTGG CCGGTGTGCG GGAGGAGAAC
CACTCCACCT TTGGCCGGGA AACCCTGCCC CGCTATGGCC TGGTGTTCAA CCCCACCGGC
AGCACCGCGG TCAAGTTCAG CCACGGCAAG CATTTCAAGG CTCCCACGCC CAATGACCTG
TTCTGGCCGG AAGATGATTT TACCCGGGGC AACCCCACCC TCAAACCCCA GACCGGGTGG
CACACCGACG TGACCATTGA ACAGGGCCTT TGCCAAAACG CCCTGCTGGT GACTGCCTCG
GCCTTTACCT GGGACATTGA CGACAAGATC GACTGGGCAC CCAACCCGGC ATTTCCCGGC
CCCTATGGTG ACAAGTGGAC CCCCACCAAC GTGAACTCCA GCCGGGGACA CGGCTGGGAG
GCGGGCCTTC GGATTCAGCC GGAGGAGCAC TGGGCCGCCG ACATCAGCTA CACCTACACC
TCGGCCACAG ACACGCTTCA GTTTGTGGAG CGGACGGCCC AGTACCTGGC AAACCACCGG
GCCAAAATCG GCGGGTCTTA CCGGTTCGGC TTCGGCCTGA CCACGGCCCT GACCTGCCGG
TACGTGGGCA CCCGGGATTT TTACCGCAGC AGCTACGACA GCCTGCCCAC CGACCGGCTT
GACTCCTATA TCACGGTGGA CCTGAAAGCC GAGCAGCGGC TGGCCGGCCA CTGGATTCTC
ACCCTGCGGG CCGACAACCT GATCGACGAG GAATATGACA CCTATGTGGG CACCTTTACC
GACAGCGCCG GCGCCATGCA GTACGGCCGG TTCCCCGGCG GCGGCAGCTC ATATTTTGCC
AGCGTTGGAT ATGAATATTA A
 
Protein sequence
MKLAIRTLLL TGCIIYCGAA VAVQADEQKA QATAMLDEIV VTATKTEETR KDVPNAVIVI 
DQAAIEASTA DTVGELLANE PGVDFRTRGD YGGAAQSLNI RGMSDTEVQV MVNGVSANSP
SLGSADIGTI PLSSIERIEI VKGSGSMLHG SGAMAGAVNI ITKRPKRDRM DAKASAGYGT
DNTYQLAAEH GQYIGDFGYY LTAGQQETDG RRDNGDMEQQ DASLALVLDR DDLLNVTLNG
SVVDREFGVP GVKPPAGTGT HYVGGVAFYN ADSASLVDNG SDTTYQASLE IKSRPTDWLA
VNLKPFYSDL ENYHSTRYND TSWGHLAGEG YKTWVYNTVK GIDGHVALDP VDGLTLLLGG
DYKDYEWETE QSFLDVSGAF NPAIPVATND AKLFTKSCFG EVQYRPSQYV KLLAGVREEN
HSTFGRETLP RYGLVFNPTG STAVKFSHGK HFKAPTPNDL FWPEDDFTRG NPTLKPQTGW
HTDVTIEQGL CQNALLVTAS AFTWDIDDKI DWAPNPAFPG PYGDKWTPTN VNSSRGHGWE
AGLRIQPEEH WAADISYTYT SATDTLQFVE RTAQYLANHR AKIGGSYRFG FGLTTALTCR
YVGTRDFYRS SYDSLPTDRL DSYITVDLKA EQRLAGHWIL TLRADNLIDE EYDTYVGTFT
DSAGAMQYGR FPGGGSSYFA SVGYEY