Gene Dole_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1408 
Symbol 
ID5694243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1674201 
End bp1675934 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content61% 
IMG OID641264001 
Producttype II secretion system protein E 
Protein accessionYP_001529289 
Protein GI158521419 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02019] bacteriochlorophyll 4-vinyl reductase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.22381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAG CAGGAGAACT GAGCGGCAAA CGCACGCGGA AAAAACTCGG CGAAATGCTG 
GTGGACGCCG GCTATCTGAC CGAGGAGCGA CTGACCGGCT ATGTGGCGGC CCAGAAGCGC
TCCGGTTTGA AGTTGGGCCA GTTCCTCATT CGGGAGGGAG TGGTCAGCGA ATCCATGATC
GTGGACCTGG TCTCCCGGCA GGCCGGTATT CAGCGGTTTG ACCCGGCCGA GTTTCCCGTG
ACCATGGAAC TGGCGAAGAG CCTGGCAGAA ACCGTGTCCC GCAAATACGG CGCGGTGCCT
TTGCGGCGGG GCAACCACCT CCTGCTGGTG GCCATGACCG ACCCCCTGGA CATTCGCTCC
CTGGATGCCA TTGAGGACGA GTGCGACCTG GAGGTGGAAC CCGTGATCTG CACGGAACAG
GAGTTCTCCC ACCTGTTTAC CCAGGTTTAC GGAACCCGCA TCGACGGTTT TGCCGGAGAG
GGATACGACC TGACCGAGAC CATGGACTAC GGAGAAGACG AGGAACCGGC AGACGCCGGG
GCCACGGAAA TATCCTCTTT GCAGCACATG GCCGAGGAGG CCCCGGTGGT GCGGCTGGTC
AACGCCCTGC TGGCCCAGGC GGTGCGGCAG GGTGCCAGTG ATATTCACAT CAGCCCGGAA
AAGCGCTACG TCCAGGTGCG GCTGCGTGTC GATGGTGTTC TGCACGAGGT GCCGGCCCCG
CCCAAGACCT TGTTTCTCTC GATTGTCTCC CGGCTCAAGA TTCTGGCCAA CCTGGACATC
TCGGTCTCCA GAATTCCCCA GGACGGCCGC TTTACCGTGA AGATCGAGAA CAAGGAGATC
AACATCCGGG TTTCCACCAT TCCCACCATT TACGGGGAAA ATGTGGTGCT GCGGCTGCTG
GACACCTCCG GCGGCGTCTA CACCCTGGAC CAGCTGGGCA TGGCTTCTGA AGACCAGGAA
AAGCTCAAGC GCAACATTCA GAAACCCTAC GGCATGATCC TGGCCACCGG GCCCACGGGC
AGCGGCAAAA GCACCTCCCT GTTTGCCATG ATCAACCGGA TCAACAAGCC GGACATCAAC
ATCATCACCC TGGAAGACCC GGTGGAGTAT CGTATCGAAA AGATCCGCCA GGCCCAGTTG
AACCGGCGGG CCGGCATGAC CTTTGCCAGC GGCCTGCGCT CCATTTTGCG CCAGGACCCG
GACGTGATTA TGGTGGGCGA AATTCGCGAC GGCGAAACCG CCCAGGTGGC CACCCAGGCG
GCCCTCACCG GCCACATGGT TTTCAGCACC GTTCACACCA ACGACGCGGC CGGCGCCATC
ACCCGGTTTA TCGACATGGG GGTGGAGCCC TTTTTGGTCT CGTCGGTGAT GCTGGTCTCC
ATGGCCCAGC GGCTGGTTCG CAAAGTGTGT GAAGACTGCG CCGAACCCTA CCAGCCGCCT
CATGAAGCGC TGGTCTTCAT GGGCCTGGAA AACGCGAAAA AAGCCACGTT CAAGCGGGGA
AAAGGATGCG CCTACTGCAT GAACACCGGA TACAGAGGCC GCACCGGCAT CTACGAGATT
CTGGAAATTG ATGACGATGT GCGGGAGATG ATCGTGTCTC GGGCCACTTC CCACCAGATC
ACAAGGGCCG CGGTTACTGC CGGCAAGCTT CAGACCCTGA AGCAGGACGC GGCCCGCAAG
GCGGCCCTGG GCATTACCAC TGTTGAGGAG GCGGCCAAGG GTGTTATGGG ATAG
 
Protein sequence
MPEAGELSGK RTRKKLGEML VDAGYLTEER LTGYVAAQKR SGLKLGQFLI REGVVSESMI 
VDLVSRQAGI QRFDPAEFPV TMELAKSLAE TVSRKYGAVP LRRGNHLLLV AMTDPLDIRS
LDAIEDECDL EVEPVICTEQ EFSHLFTQVY GTRIDGFAGE GYDLTETMDY GEDEEPADAG
ATEISSLQHM AEEAPVVRLV NALLAQAVRQ GASDIHISPE KRYVQVRLRV DGVLHEVPAP
PKTLFLSIVS RLKILANLDI SVSRIPQDGR FTVKIENKEI NIRVSTIPTI YGENVVLRLL
DTSGGVYTLD QLGMASEDQE KLKRNIQKPY GMILATGPTG SGKSTSLFAM INRINKPDIN
IITLEDPVEY RIEKIRQAQL NRRAGMTFAS GLRSILRQDP DVIMVGEIRD GETAQVATQA
ALTGHMVFST VHTNDAAGAI TRFIDMGVEP FLVSSVMLVS MAQRLVRKVC EDCAEPYQPP
HEALVFMGLE NAKKATFKRG KGCAYCMNTG YRGRTGIYEI LEIDDDVREM IVSRATSHQI
TRAAVTAGKL QTLKQDAARK AALGITTVEE AAKGVMG