Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1042 |
Symbol | |
ID | 5693877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1228042 |
End bp | 1229754 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641263639 |
Product | general secretory pathway protein E |
Protein accession | YP_001528929 |
Protein GI | 158521059 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.90473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCATGA GCAGAGATCT GTTACAGGCA CTGAAAACCC GCGCCGTTCT TTCCGAACAG CAGGCCGAGA AGATAGAACG GCTGATCAAT GGGGGCCGGG CCAATATTGG TGAGCTTCTT GTCAAGCGGA AGCTGGTGAC CGAACCGCAA CTGCTGTCCG CATACAGTGA GCTCTTCGGC ATTCCGGTCT GCGAGGCGGC CCAGCCGGAC GAGAGTGTGA TGGCCATCCC CGAAAAGGTG CCGGCCGCGT TTCTCAAACG GTATCTCATG GTGCCGCTGA AAATGTCGTC CCACGGCCAC GGGGCGGTCT CCGGTTGTCA GGTGGCGGTC AACGATCCGT CCCGGCTTCA TGCAGTGGAC GATCTGACAA GGCTGATGGG TGTAGAAACA TGCTCCCTGG TACTGGCCAC CCGGGACCAT ATTTTTTCCG CCGTGGGAAT GCTTTACGGC AGCGGAGACG GGGCGGCCGA AGAAATTGTC CGCGACATGG AGGGGGCCGA AGACATTCTT GACGAGCTCG ACGAGGCCGC TGACCTTCTG GACGATGCCA GTGACGCGCC TATTATCAAG CTGGTCAACC ATATCGTGGC CCAGTCGGTG AAGGCCGAGG CCAGTGACAT CCATATTGAG GCATACCAGG ACACCTTTAA GATACGGTAC CGGGTGGACG GCATCCTCTA TGATTTTCTG AGCCCGCCCA AGCGCATTCA GCCGGCCCTG GTCTCCCGCA TCAAGGTCAT GGCAAAAATG AACATCGCGG AAAAGCGCCT TCCCCAGGAC GGCCGCATGC AGGTCCGGCT GGGAGACAAG GAGGTGGATA TCCGCGTCTC TTCGATTCCC ATTACCGCCG GTGAGCGGCT GGTGCTTCGG CTGCTCAACA AGGCCAGCTC CATGCTGGGG TTGCAGGAGA TCGGTTTTTC CCGCCAGACC TTTGATCTGT TCAGCCGCCT GATCCGGTAT TCCAACGGCA TCATCCTGGT GACCGGCCCC ACCGGTTCAG GCAAGACCAC CACCCTTTAC GCCGCCCTTT CCACCATCAA CACGCCGGAC ATCAACATCA TCACCATTGA AGATCCGGTG GAGTACCAGG TCAGCGGCAT CAACCAGATC CAGGTGAATC AGAAGATCGG CCTGACCTTT GCCCGGGGCC TGCGCTCCAT TGTGCGGCAA GACCCGGATG TAATCCTCAT CGGCGAAATT CGGGACCAGG AGACGGCGGA TATCGCGGTG CAGTCTGCCC TGACCGGCCA CCTGGTCTTT TCCACGCTTC ACACCAATGA TTCGGCCAGC GCCGTCACCC GGCTGGTGGA TATCGGGGTG GAACCGTTTC TGCTGTCTTC CTCGGTAATC GCCGTGATCG CCCAGCGGCT GGTGCGGGTG CTGTGCCCCC GATGCAAGGA GGCCTATGTG CCGGATGACT CGGCCATTCT GAGTATCGGG GCCTCCGTCG AATTGTTTGC CGGAAAAACG ATTTACCGCA AAACGGGGTG TGATCACTGC CTGGGTACCG GCTACAGCGG CCGGATCGCG ATTTTCGAGA TACTGGTAAT GGATGAGAAG ATCAAGAAGA TGGTGCTGAC CACCCACGAT TCGGGACGGA TCGAGGCGGC GGCGGTCAAA CATGGGATGA TCACCCTTCG CCAGGACGGT ATTCACAAAG TGCTGGACGG GGTGACCAGC ATCGCCGAAG TGTTGCGGGT AACCCAGCGA TGA
|
Protein sequence | MPMSRDLLQA LKTRAVLSEQ QAEKIERLIN GGRANIGELL VKRKLVTEPQ LLSAYSELFG IPVCEAAQPD ESVMAIPEKV PAAFLKRYLM VPLKMSSHGH GAVSGCQVAV NDPSRLHAVD DLTRLMGVET CSLVLATRDH IFSAVGMLYG SGDGAAEEIV RDMEGAEDIL DELDEAADLL DDASDAPIIK LVNHIVAQSV KAEASDIHIE AYQDTFKIRY RVDGILYDFL SPPKRIQPAL VSRIKVMAKM NIAEKRLPQD GRMQVRLGDK EVDIRVSSIP ITAGERLVLR LLNKASSMLG LQEIGFSRQT FDLFSRLIRY SNGIILVTGP TGSGKTTTLY AALSTINTPD INIITIEDPV EYQVSGINQI QVNQKIGLTF ARGLRSIVRQ DPDVILIGEI RDQETADIAV QSALTGHLVF STLHTNDSAS AVTRLVDIGV EPFLLSSSVI AVIAQRLVRV LCPRCKEAYV PDDSAILSIG ASVELFAGKT IYRKTGCDHC LGTGYSGRIA IFEILVMDEK IKKMVLTTHD SGRIEAAAVK HGMITLRQDG IHKVLDGVTS IAEVLRVTQR
|
| |