Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_0329 |
Symbol | |
ID | 5693148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 376692 |
End bp | 379655 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641262910 |
Product | type IV pilus secretin PilQ |
Protein accession | YP_001528216 |
Protein GI | 158520346 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4796] Type II secretory pathway, component HofQ |
TIGRFAM ID | [TIGR02515] type IV pilus secretin (or competence protein) PilQ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00519282 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAGGA TGAACGCAAA AAAAGCGGCA AAAGTTCGCG GTACTCTTTA TTTTCTGAGC GCCATTTTGA TGGTGGCGAT GATCGCGGGA TGCACTCCCT CCGCAGGGGT CCGGCAGGGC ACCGAGGCTA CGGCACCGGA AGCAACCTCT GACACCGGGC AATTGGCCAG ACGTATTACC GGCATATCAG TCAAGGATAC GCCGGATGCC GTGGTGGTCT CTATTCAGAC CAATCACCTG GCAGACTATA CGCTGGCGGA ACCGCCCCTG GAGCAGGCCG TGGTGCTCTA TTTTCCAGAG GCCGGCCTGG CCACGACCAC GGCCGATCCG GCGGAACCCA ACGATCTGGT GGGCACCGTT ACCGCTGCGG AGCTGATTGC CGGGGGGCCT TCAAAGGTCG TTATTCCCAT GCAGCAGCCC GGCCTGGCCT ACGAGGCGAT GCGGGGCCAG GACAACAGCC TGAATATTTT ATTTAACAAG ACGGCGGTTT CCCCGGCACC GGAAGCAGCC TGGGGACAGG AAGAGGCCGC GGAAACGGTT GCTGCGGCAG AAACAAAAGA GGATGCGTCT TCTGTTGTTG AGGACGCCAC ACCCGCGGCA CCCGCCACCC TGATGGAAGA CATCACGGTG ACCGGCGATG CCGATGCCCT GGATATAACG ATTCTGGCCG ACGGCGCCAT CACCGATCAT AAACTGAGGA TACTCAAGGC CCCGCCCCGT ATCGTTTACG ACCTGCCCGG CATTCGAAGC ACCCATGCGG GCGAACAGCG CATTGCCGTG GATTCGGCCA TTGCCGGCCG CGTGCGCCAT TTTGCCCACC CCGACTACCT GCGGGTGGTA GTAGACTTAA AGGATGACCT GTATCTTGGC AAAGCCAGGG CTTACAGCTT AAGCAACGGC TTGCTGATTC ACGTGGGGGA GAAAGAAACC CCGGCCCTGG CCGCGGCCCG AAAGACCGGA CCGGTGACCA CCGAAGCCCG GACCTCTGTT GCGGAGGCCG CTTCCGTTGA ACCTGTTGAC GCCTCCCCGG CGGTTGCACC GGCACCTGCG GAGCAAGAAG CCGAGCCGGC CCCCGCCGTG GCGGTCAAGC GCTCCGGCAA GCCGGCCATG GTCAACCGCA TCGACTTTAT GGAAGAGATG GACGGCCGGT CCGCCATTGA AATCGGCACC ACCCGGCCGG TGGATTATGA GATGCTGACC ATTTCCGGCA ACCAGCTCTT TCTGAAGCTG GACAACACCG ACATTCTCAG CTATCGTCAG CGGCCCCTTA TCACCACCCG GTTTGAAAGC GCGGTGGACC TGATTTTGCC GGTACAGACA AAAAAAATGA AAGAACAACG GTTTTCCGCG GTCAACATCG ACCTGCGGGA GGCGGTGCCC TACACCATAA AACAGATGGA TAACACCATC CGCATTCTGT TTGAACCCTC TTCCGTTGCC CCGAACCCGG CAAAAGAGGT GGTGATTCCC ACGGAACTGG CCATTGAGTC CCTGGTGACC GACACCGCCG GAGCGCCGGA TATCGAGACA GCGGTCACGG AACCGGCCCC GGCATCTGAA GCGCCGGTGA CGCCTCCGGA AACCGACGTA CCGGCCCCGA TTGGAACCGT TGCGGCGCCT GAAGCCGCGG TTCCCCAGAC CGCGCCTGCT TTCGAGGCGC CCACCGGCAT GCGTACCGAA CCCGATGCAC CGGCGCCATC CCTGTTCGGC AAAAAGAAAA ACTTTACCGG CGAGCCCATT GCCCTGGATT TTTACAAAAC CGACATTCGC AACGTGATTC GCATTCTCAA GGACGTAAGC GGCAAGAACT TTGCCATCGA CGATGATGTG TCGGGCAGCG TAACCTTAAG TTTTGTCAAC CCGGTCCCCT GGGACCAGGT GCTGGACCTG ATTCTTGAGA TGAACAATCT GGGTATGGTG GAGGCCGACG GCATCATTCG CATCGCCACC CAGGCCACCC TGGTGCAGCA GAAAGAGTCG GAAAAGGCGG CCCTGACAGC CCAGCAGGAC ATGAAAAAGG CCGAAGAGAC CCTGGCCTCC CTGGTCACCG AGTATTTCTC CATCAGCTAT GCCAACGCGG GTGAGGATAT TCTGCCCCAT ATCGAGGGCC TGCTGTCGGA CCGGGGCCAT GCCAAGGTGG ACAACCGCAC CAACCAGGTG ATCATGACCG ATGTGGAAGA GAAGGTGGAA AAGGCAAGGG AGATCATCGC CAAGATCGAC AAGGTGACCC CCCAGGTGAT GATCAAGGCC CGGATCGTGG AGACCAGTTC CAGCTTCTCC AGGGAGTTCG GCACGGAGTG GGGTATTGAC AACCGTTACA ACAGCCCCAA CATCAACATC GGCACCGACG GCGCCTACAG GGACGAGATG GGCGGCACCT ACACCTATGA TGTGGCCTTG AACAGCCTCT CCTCTCCGGT GCAGAACCTG ATCGGCATCA ACTTTGCCAG AATCATCGGC ACTCCCTTTT CCCTGGATGC CAAGCTCTCC CTGATGGAGT CCACCGGCGA CGTAAAGATC ATCTCCACAC CCAAGGTAGT GACCCTGGAC AATAAAACCG CCACCATCTC CCAGGGCATC GACTATCCCT ACACCGTGGT GGAAGATGGA GAGGCGGACG TCAAGTGGAA GACCATTGAC CTCAACCTGG ATGTGACGCC CCACGTGACC CCGGACGACC GAATCTCCAT GAAGCTCAAC ATTCAGAAGA ACGACGTGGG TGAGATCATC AACGGCGAGC AGTCTTTCAA CACCAAACGG GCATCCACCG AGCTGCTGGT CAATGACGGC GACACCGTGG TAATCGGCGG CATCATCAAG GAACGGGAAG GCGCGGGTGA GCGGGGCGTG CCCTGGATCT CCAAGATTCC GGTGCTGGGC AACCTGTTCA AATACAAGAC CCGGTCGGAT GAAAAAAGCG AACTCCTGAT TTTTATCACG CCCAATGTGG TGCGTCTGGA TTAG
|
Protein sequence | MHRMNAKKAA KVRGTLYFLS AILMVAMIAG CTPSAGVRQG TEATAPEATS DTGQLARRIT GISVKDTPDA VVVSIQTNHL ADYTLAEPPL EQAVVLYFPE AGLATTTADP AEPNDLVGTV TAAELIAGGP SKVVIPMQQP GLAYEAMRGQ DNSLNILFNK TAVSPAPEAA WGQEEAAETV AAAETKEDAS SVVEDATPAA PATLMEDITV TGDADALDIT ILADGAITDH KLRILKAPPR IVYDLPGIRS THAGEQRIAV DSAIAGRVRH FAHPDYLRVV VDLKDDLYLG KARAYSLSNG LLIHVGEKET PALAAARKTG PVTTEARTSV AEAASVEPVD ASPAVAPAPA EQEAEPAPAV AVKRSGKPAM VNRIDFMEEM DGRSAIEIGT TRPVDYEMLT ISGNQLFLKL DNTDILSYRQ RPLITTRFES AVDLILPVQT KKMKEQRFSA VNIDLREAVP YTIKQMDNTI RILFEPSSVA PNPAKEVVIP TELAIESLVT DTAGAPDIET AVTEPAPASE APVTPPETDV PAPIGTVAAP EAAVPQTAPA FEAPTGMRTE PDAPAPSLFG KKKNFTGEPI ALDFYKTDIR NVIRILKDVS GKNFAIDDDV SGSVTLSFVN PVPWDQVLDL ILEMNNLGMV EADGIIRIAT QATLVQQKES EKAALTAQQD MKKAEETLAS LVTEYFSISY ANAGEDILPH IEGLLSDRGH AKVDNRTNQV IMTDVEEKVE KAREIIAKID KVTPQVMIKA RIVETSSSFS REFGTEWGID NRYNSPNINI GTDGAYRDEM GGTYTYDVAL NSLSSPVQNL IGINFARIIG TPFSLDAKLS LMESTGDVKI ISTPKVVTLD NKTATISQGI DYPYTVVEDG EADVKWKTID LNLDVTPHVT PDDRISMKLN IQKNDVGEII NGEQSFNTKR ASTELLVNDG DTVVIGGIIK EREGAGERGV PWISKIPVLG NLFKYKTRSD EKSELLIFIT PNVVRLD
|
| |