Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2844 |
Symbol | |
ID | 5695702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 3425190 |
End bp | 3427931 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641265459 |
Product | surface antigen (D15) |
Protein accession | YP_001530724 |
Protein GI | 158522854 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4775] Outer membrane protein/protective antigen OMA87 |
TIGRFAM ID | [TIGR03303] outer membrane protein assembly complex, YaeT protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGATCAA GACAGATATG GTATCGTTGT GTCCGGTGGT TTGCGGACAG TCCCGTTTCC CGGGCCATGT TTTATAACGG CGCCGTTTTC CTGCTGGCGC TCTGTTTCTT CCCTGTCCCC GCCGGGGCAG ACCAGCAGCC TCGTGTGGCG CTTTTCCCTT TGGCCGTGTG GGCCGACAGC GACGCCGCCG GTCCGCTGTC ACAACAGGTA CTGGCCCTGG TTTCCGAAAA TTTTGAACAG ACCTGCGGTG TTTCCGTTAC GCTTTACGAT GAAGCAGCGT CCCTTTTCAC GCCGGGGGAG AAGCAGCGGG CCGCCCGGGC GGCACAGGCC GATTTCGCGC TATGGGGCGG CATCACCTCC CTGGGCAACA GCTTCAGCAT TGATCTTTCC CTGCTTTCCG CGGACAGCGG GGAACTTCGC CGGTTTTTTG GCCAGGGGGA AGATCCGGGC CAACTGGCCG ATGCGATTCG CGGTCTTTCC AGTGACATGT GCGCCGCGGT CACGGGTCGC AAAAGGATCG AGCAGATTGT GGTCCAGGGC AGTGATCGTA TTGAGGCCGA TGCCATTGTG CGGGTGATTC GGACCAGGGA AGGGGAAATA TACAACCGGG AACAGCTCTC GGAAGACCTG AAGCACATTT ACGCCATGGG CTATTTTGAG GACGTGCGGG TGGAAGCCGA AGATTCGGCC CGGGGGAAAA AGATCGTTTT TACCGTCAGG GAGCGGCCCA CGGTCGGCCG TATCACCATC CACAAGTACA CGGTTTTTTC CGAAGAGGAG ATCAGGGAGG AGCTTGTCAT CAAGCGTGGC GCGGTGCTGA ATATCGTTGA GATTCAGGAG AGTGTCAACC GCATTGAGCA GCTTTACAAG GGCAAGAACT ATCACAACGT AAAAGTCTCC TACGAGATTC AGGAGGAAAA GAACAACCAG GCCGATGTGG CCTTTACCAT TGACCGGGGC GACAAGTTCA TGGTGCGCGA GATTATTTTT GCGGGCAACA GCGCATACAG GGACAAGGAG CTGCGCAAAA TCATCAAAAC CAGAAAAAAA GGATTCTTCT GGTGGCTCAC ATCGTCGGGG GATCTGAACA TGGATCAGCT GCGCCAGGAC GTGGCCATGC TGAGCAACCA TTACCACAAC AGCGGCTATA TTGATGCCCG GATCGGAGAC CCTGAAATCG AATACCTGGA GGAACGGATT CGCATCACCT TCAAGATTGA GGAGGGACAG CGATTCCGGG TGGGAATCGT GGACCTTCAG GGGGACGCGG ATGAGTTCAG GCCCGAGCTT GAGAAAATGC TCAACATTAC CGAAGAGGAA TTTTTCAACC GGTCCATGGT CCGGGCCGAT ATTCTTTCGA TCACTGGTTT TTACGGGGAC AAGGGGTATT TCTATGCCGA CGTGATTCCC GATGTGCGGA AAAATGATGC CGACCTGACG GTGGATATCA CGTATATCGT TCAAAAGGGC GGGCTGGTCT ATTTTGACGA TATTATCATC ACCGGCAACA CCAAGACCCG GGACAAGGTG ATCCGTCGGG AGCTGGATGT CTATGAGCAG GAGCTTTACA GCGGCAGCCG TTTGAAAAAG AGCGTGTCCC GGCTTCACCG GCTTAACTAT TTTGAGACCC TCAAAGTAGA CACCGTTGAA GGCGCGGAAC CGGACAAGGT GGACCTGAAG ATTGAGGTGG AGGAAAAACC CACGGGCATG TTCTCCTTCG GCGGCGGGTA CAGCAGCGTG GAGAGCCTGT TTTTTACGGC CTCCGTCTCC CAGCAGAACC TGTTCGGCAG GGGCCAGGTG CTCAACCTTC AGGGCCAGAT CGGCGGCACC TCCTCCGAGT ACCGGCTCAG TTTTACCGAG CCCTATCTGT TTGACACCCG TGTTTCTGCA GGCATCGATG TGTATGACTG GAACGTGGAT GCCGATACCT ATGACCGGCA TACCATCGGC GGCAGCCTTC GGTTCGGTTA TCCCCTGTTT GAAAACACCC GCCTTTACCT GGCGTATACC TATGATGTCA ATGAGGTGGA CGATGTTTCC ATCTATGCGC CCTGGTCCAT TCAGGAGATG GCCGCCCAGG GAGGCGAGAG TGTCACCAGC GGGGCCTCTG TTTCCCTGGT GTATGATACC CGTGATAACT ATATGAATCC CTCCCGGGGC ACCAAGAGTA CGATTGCCAT TGAGAACGCC GGCGGTCCTC TGGGCGGAGA TGTGGCGTTC ACCAAGTACA CCGGTGAGAC CGGCTGGTAT CACCCTCTTT TCTGGCGGTT TGTGGGTTTT GCCCATGCCA AGGGCGGTTA TGTGCATGAA AATTCCGGCG GGTTTCTGCC GGACTATGAC CGGTTCTACC TGGGCGGCAT CAACTCGTTG CGGGGGTTTG GCTGGCGTGA TATATCGGTG AAAGAGACCG TAGAGGTATG GTCCAGTCAA ACCAACAGCT GGCAGGAGGT GATCGTGGAA GAGAAGGGCG GCGACAAGTT TGTGCAGTTT AACATCGAAC TGCTCTGCCC CCTGTTTGAC AAAAAGGCCG GTCTGGTGGG AGTGTTATTT TATGATGCCG GTAACGTGTA TGATGAAGGC CAGGAGTTGT TTGACCTGCG CCCACGGGAG AGCGCCGGGT TCGGTATCCG GTGGTTCTCG CCCATGGGTC CCATTCGCCT GGAGCGGGGA TACATCCTGG ACCCCAGGCC GGGCGAGGAT TCTGGCGGAC GGTGGGAGTT CACCATCGGC ACGGCATTTT AA
|
Protein sequence | MRSRQIWYRC VRWFADSPVS RAMFYNGAVF LLALCFFPVP AGADQQPRVA LFPLAVWADS DAAGPLSQQV LALVSENFEQ TCGVSVTLYD EAASLFTPGE KQRAARAAQA DFALWGGITS LGNSFSIDLS LLSADSGELR RFFGQGEDPG QLADAIRGLS SDMCAAVTGR KRIEQIVVQG SDRIEADAIV RVIRTREGEI YNREQLSEDL KHIYAMGYFE DVRVEAEDSA RGKKIVFTVR ERPTVGRITI HKYTVFSEEE IREELVIKRG AVLNIVEIQE SVNRIEQLYK GKNYHNVKVS YEIQEEKNNQ ADVAFTIDRG DKFMVREIIF AGNSAYRDKE LRKIIKTRKK GFFWWLTSSG DLNMDQLRQD VAMLSNHYHN SGYIDARIGD PEIEYLEERI RITFKIEEGQ RFRVGIVDLQ GDADEFRPEL EKMLNITEEE FFNRSMVRAD ILSITGFYGD KGYFYADVIP DVRKNDADLT VDITYIVQKG GLVYFDDIII TGNTKTRDKV IRRELDVYEQ ELYSGSRLKK SVSRLHRLNY FETLKVDTVE GAEPDKVDLK IEVEEKPTGM FSFGGGYSSV ESLFFTASVS QQNLFGRGQV LNLQGQIGGT SSEYRLSFTE PYLFDTRVSA GIDVYDWNVD ADTYDRHTIG GSLRFGYPLF ENTRLYLAYT YDVNEVDDVS IYAPWSIQEM AAQGGESVTS GASVSLVYDT RDNYMNPSRG TKSTIAIENA GGPLGGDVAF TKYTGETGWY HPLFWRFVGF AHAKGGYVHE NSGGFLPDYD RFYLGGINSL RGFGWRDISV KETVEVWSSQ TNSWQEVIVE EKGGDKFVQF NIELLCPLFD KKAGLVGVLF YDAGNVYDEG QELFDLRPRE SAGFGIRWFS PMGPIRLERG YILDPRPGED SGGRWEFTIG TAF
|
| |