Gene Dole_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1042 
Symbol 
ID5693877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1228042 
End bp1229754 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content59% 
IMG OID641263639 
Productgeneral secretory pathway protein E 
Protein accessionYP_001528929 
Protein GI158521059 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.90473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATGA GCAGAGATCT GTTACAGGCA CTGAAAACCC GCGCCGTTCT TTCCGAACAG 
CAGGCCGAGA AGATAGAACG GCTGATCAAT GGGGGCCGGG CCAATATTGG TGAGCTTCTT
GTCAAGCGGA AGCTGGTGAC CGAACCGCAA CTGCTGTCCG CATACAGTGA GCTCTTCGGC
ATTCCGGTCT GCGAGGCGGC CCAGCCGGAC GAGAGTGTGA TGGCCATCCC CGAAAAGGTG
CCGGCCGCGT TTCTCAAACG GTATCTCATG GTGCCGCTGA AAATGTCGTC CCACGGCCAC
GGGGCGGTCT CCGGTTGTCA GGTGGCGGTC AACGATCCGT CCCGGCTTCA TGCAGTGGAC
GATCTGACAA GGCTGATGGG TGTAGAAACA TGCTCCCTGG TACTGGCCAC CCGGGACCAT
ATTTTTTCCG CCGTGGGAAT GCTTTACGGC AGCGGAGACG GGGCGGCCGA AGAAATTGTC
CGCGACATGG AGGGGGCCGA AGACATTCTT GACGAGCTCG ACGAGGCCGC TGACCTTCTG
GACGATGCCA GTGACGCGCC TATTATCAAG CTGGTCAACC ATATCGTGGC CCAGTCGGTG
AAGGCCGAGG CCAGTGACAT CCATATTGAG GCATACCAGG ACACCTTTAA GATACGGTAC
CGGGTGGACG GCATCCTCTA TGATTTTCTG AGCCCGCCCA AGCGCATTCA GCCGGCCCTG
GTCTCCCGCA TCAAGGTCAT GGCAAAAATG AACATCGCGG AAAAGCGCCT TCCCCAGGAC
GGCCGCATGC AGGTCCGGCT GGGAGACAAG GAGGTGGATA TCCGCGTCTC TTCGATTCCC
ATTACCGCCG GTGAGCGGCT GGTGCTTCGG CTGCTCAACA AGGCCAGCTC CATGCTGGGG
TTGCAGGAGA TCGGTTTTTC CCGCCAGACC TTTGATCTGT TCAGCCGCCT GATCCGGTAT
TCCAACGGCA TCATCCTGGT GACCGGCCCC ACCGGTTCAG GCAAGACCAC CACCCTTTAC
GCCGCCCTTT CCACCATCAA CACGCCGGAC ATCAACATCA TCACCATTGA AGATCCGGTG
GAGTACCAGG TCAGCGGCAT CAACCAGATC CAGGTGAATC AGAAGATCGG CCTGACCTTT
GCCCGGGGCC TGCGCTCCAT TGTGCGGCAA GACCCGGATG TAATCCTCAT CGGCGAAATT
CGGGACCAGG AGACGGCGGA TATCGCGGTG CAGTCTGCCC TGACCGGCCA CCTGGTCTTT
TCCACGCTTC ACACCAATGA TTCGGCCAGC GCCGTCACCC GGCTGGTGGA TATCGGGGTG
GAACCGTTTC TGCTGTCTTC CTCGGTAATC GCCGTGATCG CCCAGCGGCT GGTGCGGGTG
CTGTGCCCCC GATGCAAGGA GGCCTATGTG CCGGATGACT CGGCCATTCT GAGTATCGGG
GCCTCCGTCG AATTGTTTGC CGGAAAAACG ATTTACCGCA AAACGGGGTG TGATCACTGC
CTGGGTACCG GCTACAGCGG CCGGATCGCG ATTTTCGAGA TACTGGTAAT GGATGAGAAG
ATCAAGAAGA TGGTGCTGAC CACCCACGAT TCGGGACGGA TCGAGGCGGC GGCGGTCAAA
CATGGGATGA TCACCCTTCG CCAGGACGGT ATTCACAAAG TGCTGGACGG GGTGACCAGC
ATCGCCGAAG TGTTGCGGGT AACCCAGCGA TGA
 
Protein sequence
MPMSRDLLQA LKTRAVLSEQ QAEKIERLIN GGRANIGELL VKRKLVTEPQ LLSAYSELFG 
IPVCEAAQPD ESVMAIPEKV PAAFLKRYLM VPLKMSSHGH GAVSGCQVAV NDPSRLHAVD
DLTRLMGVET CSLVLATRDH IFSAVGMLYG SGDGAAEEIV RDMEGAEDIL DELDEAADLL
DDASDAPIIK LVNHIVAQSV KAEASDIHIE AYQDTFKIRY RVDGILYDFL SPPKRIQPAL
VSRIKVMAKM NIAEKRLPQD GRMQVRLGDK EVDIRVSSIP ITAGERLVLR LLNKASSMLG
LQEIGFSRQT FDLFSRLIRY SNGIILVTGP TGSGKTTTLY AALSTINTPD INIITIEDPV
EYQVSGINQI QVNQKIGLTF ARGLRSIVRQ DPDVILIGEI RDQETADIAV QSALTGHLVF
STLHTNDSAS AVTRLVDIGV EPFLLSSSVI AVIAQRLVRV LCPRCKEAYV PDDSAILSIG
ASVELFAGKT IYRKTGCDHC LGTGYSGRIA IFEILVMDEK IKKMVLTTHD SGRIEAAAVK
HGMITLRQDG IHKVLDGVTS IAEVLRVTQR