Gene Dole_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0333 
Symbol 
ID5693152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp381822 
End bp382877 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content52% 
IMG OID641262914 
Producttype IV pilus assembly protein PilM 
Protein accessionYP_001528220 
Protein GI158520350 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01175] type IV pilus assembly protein PilM 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.872889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTTCG GAAAAAAGGA TCATCTGGTG GGGCTTGACA TCGGCTCCAG CGTGGTCAAG 
GTCGCGGAGG TGGCCGTCTC CTCCTCTGGT CGAAGCCTGC TGCGGTTCGG CACCCTGGAA
ATGCCGGCAG GCGCCATTGT GGAGGAGGGC ATCAAGGATC CGGAGGTGGT GGCCGCCACC
ATAAAAGAGC TTTTTTCACT CTATAATATA AAGGAAGACC GGGTGGCGAT CTCCATCGGC
GGATACTCGG TAATTGTCAA AAAGATCAAC GTGCAGAGCA TGAGTGAAGA GCAGCTTCAG
GAAGTGATCG CCGTTGAGGC GGAGCAGTAT ATCCCTTTTG ACATCAACAA CGTGAATTTG
GATTTCCAGA TTCTGGGCGA CAATGACCAG AACCCCAACC AGATCGATGT GATGCTGGTG
GCGGCCAAAA AAGAGACGGT CAACGATTAC CTTAACGTCA TCGATATCGC CGGGTTGACG
CCGGTGATCA TTGATGTGGA CGCCTTTGCC CTTCAAAACA TTTACGAAGT CAACTATGAG
GCCGAGGAGA ACTGCGTTGC CCTCATCGAC ATCGGGGCCA ACAAGACGTC ACTCAATATT
CTGAAGGGCA GCAAATCGGT ATTCATGCGG GATGTCTCCT TTGGGTGCAA TCAGATCAAT
CATCATATCG CCACCAAAAT CAACTGCAGC CTGGAAGAAG CAGAAGAGCT GAAGCTCAGC
GACAAGCAGG AGCGCATCTC AACCGACGAG CTTAACAGCA TCATCTCTTC GGTGGTGTCC
GACTGGACCA CCGAAATCCG GCGGGCCCTG GATTTCTTTT ATTCCACCTA CTCCAACGAT
CATCTCAAGT GTATATATTT AAGCGGCGGC GGATCCAACA TCCCCGAATT CAGGCAAATG
CTGGCGTCCC AAACCTCCTC GGAGCTGGAA ATACTTAATC CTTTCAGCAA CTTCGACGTC
AAGGACGACC GGCTGGATTC GGCTTACCTG AAACAAATCG CACCCCAGGC AGCCATCTGC
ATGGGGATTG CCATCAGACG GGTGGATGAC AAATGA
 
Protein sequence
MLFGKKDHLV GLDIGSSVVK VAEVAVSSSG RSLLRFGTLE MPAGAIVEEG IKDPEVVAAT 
IKELFSLYNI KEDRVAISIG GYSVIVKKIN VQSMSEEQLQ EVIAVEAEQY IPFDINNVNL
DFQILGDNDQ NPNQIDVMLV AAKKETVNDY LNVIDIAGLT PVIIDVDAFA LQNIYEVNYE
AEENCVALID IGANKTSLNI LKGSKSVFMR DVSFGCNQIN HHIATKINCS LEEAEELKLS
DKQERISTDE LNSIISSVVS DWTTEIRRAL DFFYSTYSND HLKCIYLSGG GSNIPEFRQM
LASQTSSELE ILNPFSNFDV KDDRLDSAYL KQIAPQAAIC MGIAIRRVDD K