Gene Dole_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1788 
Symbol 
ID5694628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2158421 
End bp2161699 
Gene Length3279 bp 
Protein Length1092 aa 
Translation table11 
GC content54% 
IMG OID641264386 
Productpolymorphic membrane protein 
Protein accessionYP_001529669 
Protein GI158521799 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCGC TTAAATGGAA AGCAGCACCC TTCTTTTTTG GTCTAAGTGT TTTTGCATTG 
CTGTGCCTGT CACCGTTATC CGCATTTTCT CAAAACATCA ACATTGATCC GTGTGTTGCC
ATACTGGGCC TTGATGGTGG CGAGTGCGGT GGATGGGCGA TGGATAACAG CTTTGACTGT
ATGTATGAAG AAAATTCTAT TCTCCGTGTC ATCAAATATC TGGATAACAA ATCATTTCAA
TATCGAAGCG TTTTGGAGTT CAACTTGCCG ACGGAGGTAA TGGATTCAAC GATCAGCTCT
GCTACCCTCT TTTTGCCGAA CGACAGTTCC TACGGCTACC TGTACGATCG GGTGATTTTA
AATGGATATA ACGCGGACTG CACCGCCCGG CTTGATGATT TTGAGAATAC AGATAATGCT
ATTGAATATT TTCCTGCCCA TACCAGTGGG ACGGACTATT ATATAGATGT AACTGAATTT
ATAAGCACAA TGCAGGGAAC TACGGGAACG ATAGGGTTTA ACCTGATTTC AAACGGTTCG
GATGGGTATT TAGATGTACG AATTGCCAAT ACTGCTGAAA GGCGTTCCCA GCTCCGGGTG
CAATATGCGG CAAAAATTGT TGTGGATTCG GATGCAACTG GCAACAATGA TGGCACATCA
TGGACAGACG CCTATGTCTC TCTGCAGGAC GCCCTCAACG CCGCCATGGC CGGCGACCAG
ATCTGGGTGG CGGCCGGCAC CTATTACCCG GATGAGGGCA GCAGCCAGAC AGACAACGAC
CGCGCGTCAA CATTTTTAAT CCCCAGCGGT GTGGCTGTTT ATGGGGGCTT CGACGGCACG
GACGGTGCCG GCGGCGGCGC CCTGGAAACC ACCCTTGACG AACGGGACTG GGAAAACAAT
CCAACCATTC TAAGCGGAGA CCTGGGCCAG GATGACGATT CAGGCGGCGA CAACAGCGAA
AATGCCTACC ATGTGGTCTG GTTTGACCGG GCCGGCAACC AGACCCGCCT GGATGGCGTT
ACCATAACCG CCGGCAACGC CGATGGGAGC CTCTGGAACG ACAAGAACGG AGGCGGAATT
TACAATGACG GTTCCGGCAG CGGAAACAGC AGCAATCCCT GCTTAAACAA CTGCACCATC
AGCGGGAACA ATGCTTTTAA TGCCGGCGGC GGTATATATA ATAACGGGTC TGACACCCTC
AGCCTTACCG GGGACAGCAG CCCCACCCTG ACCGACTGTG TCATCAGCGG GAATAATGCC
GGTTCCGGCG GCGGCATATA TAATTACGGC AGTTATGGCA ACAGCAGCCC GGTATTGACC
AACTGCACGA TCAACGAAAA CAACGCGTAC AACGGCGGTG GCATCTACAA CAACAGCAGC
AACGGCAACA GCAGCCCCAT CCTTTTGAAC TGTATCATCA ACGGTAATAC CGCTGAAATG
GGGGGCGGGC TCCATAACGA AAGTGACGAT ACCGGAACCT GCATGCCTGA GCTGACCGGT
TGTACCATCA CCAACAACAC CGCTCTCTAT GAAGGCGGTG GAATGGATAA CTATGCACAC
AATGGAACCA CAAGCCCGAT TCTCACCGAC TGCACCATCA GTGACAATTC TGCCGATACC
GGCGGCGGAT TCTCCAATAA CGCTTTAGAG CATGGAATAT GCAGCCCCAC CCTGTATAAC
TGCGTCATCA GCGGCAATGC CGCTACGGGT TTTGGGGGCG GAATGTATAA TTATAACAAC
AGCGGAGAGA CCTGCCCTGC CCTGACAAAC TGCACCATCA CCAACAACAC CGCAACCTAT
GAAGGCGGTG GCATCGGTAA TTATGCGCAT AATAATGGGG CATCAAGCAG CCCTGCGCTG
ACCAACTGTG TTATCCGTGG CAACACGGCT GAAGTGGGTG GTGGCATCAG TAACTACGCG
GATAATGAGG GCACATCAAG CCCTGTCCTT ACCAACTGCA CCATCAGTGG CAATGCCGCC
GACAACGGCG GCGGTATTTT CAATGGCGGG TTCGAAGGCA TAATCAGCCC GGCCTTAACC
AACTGCATCC TCTGGAACAA CCATGCGGAT ATATCCGGCC CTGAAATATA TAATGATGGT
GGTTCTCCCG CTTACGCCTA CTGCAACATC AAGGGCAGCG GCGGCAGCGC GGCCTGGAAC
ACCACCCTGG GCACGGACAG CGGCAACAAC ATAGACGCCG ATCCCATATT TGCAAACCCG
GCCGCCGACC TGCGCCTGAT GGCCGGTTCG CCCTGCCTGG ACACCGGCAA CAATGTTGCC
AACGGCACGT CGTTTGACCT GGATGGCGAA GCCCGTATCC AGAACACCAT CATTGACATG
GGGGCCTACG AAGGCGCGGA AGCCCTGCCC TTTGCCAACC TGGTCAACTG GAACGGCAAC
CTGGTGGCCG ACTTTGGCGA CAACGGCCTG TGGTACCACA ACGGCAGCAA CTGGAACTGG
ATGACCAACC GGGGCCATGT CAACCAGATG GTGGTGTGGG ATGGCAAGCT GGTGGTGGAT
TTTGGTTCCG ACTACGGAGT GCACTACTAT GACGGCACCG GCTGGACCTG GATGTCCAAC
AAAGGCGGTG TGGCAAAAAT GATCACCTGG AACAATGGCG CAACAGAAAA GCTGGTGGTG
GACTTTGGCG CAGGCAAGCG CGTCTACACA TATAACGGTT CCTGGAGCTG GTTTACCAAC
AAGGACGCCG TGGCAAACAT GACCGTGTGG GATAACCGGC TGGTGGTCGA CTTCGGGTCC
GGCCGGGGCG TGTACAACAA CAACGGCACC TGGAACTGGA TGACCAACAA GGACGACATT
GCCAAAATGG TGGCCTGGAA CAACGGTTCT GCCGAGCGGC TGGTGGTGGA CTTTGGCGGG
GGCCGCCGGG TGTACACCTA CAACGGCGCC TGGGCATGGC TCACCAACAA GGATGACGTG
GCCGACATGA CCGTGTGGAA CAACAAGCTG GTTGTCGATT TCGGTAACGG CCGGAGCGTG
TACAACTATG ATGGTGCATG GCACTGGATG ACCAACAAGG ATAACGTGAC CAAAATGGTG
ACCTGGAAGG ACGGTGCCGA CAAACTGGCC GTAGACTTCG GCTCAGGCCG GGGGATGTTC
TATTATGACG GTGTATGGCA CTGGATGTCC AACAAGGATG ATCTTACGGA TATGATCGCC
TGGGGCACCC GCCTGGCCGT GGATTTTGGC TCAGGCCGGG GGATGTACAA CTACGACGGT
GCCTGGCACT GGATGAAAAA CTGGAGTACG GCAGACTGA
 
Protein sequence
MTALKWKAAP FFFGLSVFAL LCLSPLSAFS QNINIDPCVA ILGLDGGECG GWAMDNSFDC 
MYEENSILRV IKYLDNKSFQ YRSVLEFNLP TEVMDSTISS ATLFLPNDSS YGYLYDRVIL
NGYNADCTAR LDDFENTDNA IEYFPAHTSG TDYYIDVTEF ISTMQGTTGT IGFNLISNGS
DGYLDVRIAN TAERRSQLRV QYAAKIVVDS DATGNNDGTS WTDAYVSLQD ALNAAMAGDQ
IWVAAGTYYP DEGSSQTDND RASTFLIPSG VAVYGGFDGT DGAGGGALET TLDERDWENN
PTILSGDLGQ DDDSGGDNSE NAYHVVWFDR AGNQTRLDGV TITAGNADGS LWNDKNGGGI
YNDGSGSGNS SNPCLNNCTI SGNNAFNAGG GIYNNGSDTL SLTGDSSPTL TDCVISGNNA
GSGGGIYNYG SYGNSSPVLT NCTINENNAY NGGGIYNNSS NGNSSPILLN CIINGNTAEM
GGGLHNESDD TGTCMPELTG CTITNNTALY EGGGMDNYAH NGTTSPILTD CTISDNSADT
GGGFSNNALE HGICSPTLYN CVISGNAATG FGGGMYNYNN SGETCPALTN CTITNNTATY
EGGGIGNYAH NNGASSSPAL TNCVIRGNTA EVGGGISNYA DNEGTSSPVL TNCTISGNAA
DNGGGIFNGG FEGIISPALT NCILWNNHAD ISGPEIYNDG GSPAYAYCNI KGSGGSAAWN
TTLGTDSGNN IDADPIFANP AADLRLMAGS PCLDTGNNVA NGTSFDLDGE ARIQNTIIDM
GAYEGAEALP FANLVNWNGN LVADFGDNGL WYHNGSNWNW MTNRGHVNQM VVWDGKLVVD
FGSDYGVHYY DGTGWTWMSN KGGVAKMITW NNGATEKLVV DFGAGKRVYT YNGSWSWFTN
KDAVANMTVW DNRLVVDFGS GRGVYNNNGT WNWMTNKDDI AKMVAWNNGS AERLVVDFGG
GRRVYTYNGA WAWLTNKDDV ADMTVWNNKL VVDFGNGRSV YNYDGAWHWM TNKDNVTKMV
TWKDGADKLA VDFGSGRGMF YYDGVWHWMS NKDDLTDMIA WGTRLAVDFG SGRGMYNYDG
AWHWMKNWST AD