Gene Dole_0329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0329 
Symbol 
ID5693148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp376692 
End bp379655 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content59% 
IMG OID641262910 
Producttype IV pilus secretin PilQ 
Protein accessionYP_001528216 
Protein GI158520346 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4796] Type II secretory pathway, component HofQ 
TIGRFAM ID[TIGR02515] type IV pilus secretin (or competence protein) PilQ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00519282 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACAGGA TGAACGCAAA AAAAGCGGCA AAAGTTCGCG GTACTCTTTA TTTTCTGAGC 
GCCATTTTGA TGGTGGCGAT GATCGCGGGA TGCACTCCCT CCGCAGGGGT CCGGCAGGGC
ACCGAGGCTA CGGCACCGGA AGCAACCTCT GACACCGGGC AATTGGCCAG ACGTATTACC
GGCATATCAG TCAAGGATAC GCCGGATGCC GTGGTGGTCT CTATTCAGAC CAATCACCTG
GCAGACTATA CGCTGGCGGA ACCGCCCCTG GAGCAGGCCG TGGTGCTCTA TTTTCCAGAG
GCCGGCCTGG CCACGACCAC GGCCGATCCG GCGGAACCCA ACGATCTGGT GGGCACCGTT
ACCGCTGCGG AGCTGATTGC CGGGGGGCCT TCAAAGGTCG TTATTCCCAT GCAGCAGCCC
GGCCTGGCCT ACGAGGCGAT GCGGGGCCAG GACAACAGCC TGAATATTTT ATTTAACAAG
ACGGCGGTTT CCCCGGCACC GGAAGCAGCC TGGGGACAGG AAGAGGCCGC GGAAACGGTT
GCTGCGGCAG AAACAAAAGA GGATGCGTCT TCTGTTGTTG AGGACGCCAC ACCCGCGGCA
CCCGCCACCC TGATGGAAGA CATCACGGTG ACCGGCGATG CCGATGCCCT GGATATAACG
ATTCTGGCCG ACGGCGCCAT CACCGATCAT AAACTGAGGA TACTCAAGGC CCCGCCCCGT
ATCGTTTACG ACCTGCCCGG CATTCGAAGC ACCCATGCGG GCGAACAGCG CATTGCCGTG
GATTCGGCCA TTGCCGGCCG CGTGCGCCAT TTTGCCCACC CCGACTACCT GCGGGTGGTA
GTAGACTTAA AGGATGACCT GTATCTTGGC AAAGCCAGGG CTTACAGCTT AAGCAACGGC
TTGCTGATTC ACGTGGGGGA GAAAGAAACC CCGGCCCTGG CCGCGGCCCG AAAGACCGGA
CCGGTGACCA CCGAAGCCCG GACCTCTGTT GCGGAGGCCG CTTCCGTTGA ACCTGTTGAC
GCCTCCCCGG CGGTTGCACC GGCACCTGCG GAGCAAGAAG CCGAGCCGGC CCCCGCCGTG
GCGGTCAAGC GCTCCGGCAA GCCGGCCATG GTCAACCGCA TCGACTTTAT GGAAGAGATG
GACGGCCGGT CCGCCATTGA AATCGGCACC ACCCGGCCGG TGGATTATGA GATGCTGACC
ATTTCCGGCA ACCAGCTCTT TCTGAAGCTG GACAACACCG ACATTCTCAG CTATCGTCAG
CGGCCCCTTA TCACCACCCG GTTTGAAAGC GCGGTGGACC TGATTTTGCC GGTACAGACA
AAAAAAATGA AAGAACAACG GTTTTCCGCG GTCAACATCG ACCTGCGGGA GGCGGTGCCC
TACACCATAA AACAGATGGA TAACACCATC CGCATTCTGT TTGAACCCTC TTCCGTTGCC
CCGAACCCGG CAAAAGAGGT GGTGATTCCC ACGGAACTGG CCATTGAGTC CCTGGTGACC
GACACCGCCG GAGCGCCGGA TATCGAGACA GCGGTCACGG AACCGGCCCC GGCATCTGAA
GCGCCGGTGA CGCCTCCGGA AACCGACGTA CCGGCCCCGA TTGGAACCGT TGCGGCGCCT
GAAGCCGCGG TTCCCCAGAC CGCGCCTGCT TTCGAGGCGC CCACCGGCAT GCGTACCGAA
CCCGATGCAC CGGCGCCATC CCTGTTCGGC AAAAAGAAAA ACTTTACCGG CGAGCCCATT
GCCCTGGATT TTTACAAAAC CGACATTCGC AACGTGATTC GCATTCTCAA GGACGTAAGC
GGCAAGAACT TTGCCATCGA CGATGATGTG TCGGGCAGCG TAACCTTAAG TTTTGTCAAC
CCGGTCCCCT GGGACCAGGT GCTGGACCTG ATTCTTGAGA TGAACAATCT GGGTATGGTG
GAGGCCGACG GCATCATTCG CATCGCCACC CAGGCCACCC TGGTGCAGCA GAAAGAGTCG
GAAAAGGCGG CCCTGACAGC CCAGCAGGAC ATGAAAAAGG CCGAAGAGAC CCTGGCCTCC
CTGGTCACCG AGTATTTCTC CATCAGCTAT GCCAACGCGG GTGAGGATAT TCTGCCCCAT
ATCGAGGGCC TGCTGTCGGA CCGGGGCCAT GCCAAGGTGG ACAACCGCAC CAACCAGGTG
ATCATGACCG ATGTGGAAGA GAAGGTGGAA AAGGCAAGGG AGATCATCGC CAAGATCGAC
AAGGTGACCC CCCAGGTGAT GATCAAGGCC CGGATCGTGG AGACCAGTTC CAGCTTCTCC
AGGGAGTTCG GCACGGAGTG GGGTATTGAC AACCGTTACA ACAGCCCCAA CATCAACATC
GGCACCGACG GCGCCTACAG GGACGAGATG GGCGGCACCT ACACCTATGA TGTGGCCTTG
AACAGCCTCT CCTCTCCGGT GCAGAACCTG ATCGGCATCA ACTTTGCCAG AATCATCGGC
ACTCCCTTTT CCCTGGATGC CAAGCTCTCC CTGATGGAGT CCACCGGCGA CGTAAAGATC
ATCTCCACAC CCAAGGTAGT GACCCTGGAC AATAAAACCG CCACCATCTC CCAGGGCATC
GACTATCCCT ACACCGTGGT GGAAGATGGA GAGGCGGACG TCAAGTGGAA GACCATTGAC
CTCAACCTGG ATGTGACGCC CCACGTGACC CCGGACGACC GAATCTCCAT GAAGCTCAAC
ATTCAGAAGA ACGACGTGGG TGAGATCATC AACGGCGAGC AGTCTTTCAA CACCAAACGG
GCATCCACCG AGCTGCTGGT CAATGACGGC GACACCGTGG TAATCGGCGG CATCATCAAG
GAACGGGAAG GCGCGGGTGA GCGGGGCGTG CCCTGGATCT CCAAGATTCC GGTGCTGGGC
AACCTGTTCA AATACAAGAC CCGGTCGGAT GAAAAAAGCG AACTCCTGAT TTTTATCACG
CCCAATGTGG TGCGTCTGGA TTAG
 
Protein sequence
MHRMNAKKAA KVRGTLYFLS AILMVAMIAG CTPSAGVRQG TEATAPEATS DTGQLARRIT 
GISVKDTPDA VVVSIQTNHL ADYTLAEPPL EQAVVLYFPE AGLATTTADP AEPNDLVGTV
TAAELIAGGP SKVVIPMQQP GLAYEAMRGQ DNSLNILFNK TAVSPAPEAA WGQEEAAETV
AAAETKEDAS SVVEDATPAA PATLMEDITV TGDADALDIT ILADGAITDH KLRILKAPPR
IVYDLPGIRS THAGEQRIAV DSAIAGRVRH FAHPDYLRVV VDLKDDLYLG KARAYSLSNG
LLIHVGEKET PALAAARKTG PVTTEARTSV AEAASVEPVD ASPAVAPAPA EQEAEPAPAV
AVKRSGKPAM VNRIDFMEEM DGRSAIEIGT TRPVDYEMLT ISGNQLFLKL DNTDILSYRQ
RPLITTRFES AVDLILPVQT KKMKEQRFSA VNIDLREAVP YTIKQMDNTI RILFEPSSVA
PNPAKEVVIP TELAIESLVT DTAGAPDIET AVTEPAPASE APVTPPETDV PAPIGTVAAP
EAAVPQTAPA FEAPTGMRTE PDAPAPSLFG KKKNFTGEPI ALDFYKTDIR NVIRILKDVS
GKNFAIDDDV SGSVTLSFVN PVPWDQVLDL ILEMNNLGMV EADGIIRIAT QATLVQQKES
EKAALTAQQD MKKAEETLAS LVTEYFSISY ANAGEDILPH IEGLLSDRGH AKVDNRTNQV
IMTDVEEKVE KAREIIAKID KVTPQVMIKA RIVETSSSFS REFGTEWGID NRYNSPNINI
GTDGAYRDEM GGTYTYDVAL NSLSSPVQNL IGINFARIIG TPFSLDAKLS LMESTGDVKI
ISTPKVVTLD NKTATISQGI DYPYTVVEDG EADVKWKTID LNLDVTPHVT PDDRISMKLN
IQKNDVGEII NGEQSFNTKR ASTELLVNDG DTVVIGGIIK EREGAGERGV PWISKIPVLG
NLFKYKTRSD EKSELLIFIT PNVVRLD