Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0791 |
Symbol | |
ID | 8415081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 984283 |
End bp | 986730 |
Gene Length | 2448 bp |
Protein Length | 815 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023757 |
Product | Type IV secretory pathway VirB4 protein-like protein |
Protein accession | YP_003181154 |
Protein GI | 257790548 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGTAC AGGGCGCGGC GTCGCGCCTG CTCGCCGCGC TGGCCAGCAG GAGGTCGAAG GAAAAGGACC CCGGCCGCGA CGAGGCCCGC GCCCGCAGCA GCAAAAAGCG CTGCTCAAGC GCAGACGCGG CGAGCTTCAT CGCCTACGAC GCGCTGTACA AAGACGGCAT CGCCGAGGTC GAGCCCGGCC TATTCAGCCA GACCGTCGAA TTCTCCGACA TCAGCTACCA ATCCGCCCGA AAGGAAACCC AAGAGCAGAT CTTCACCACG CTGAGCTCGC TCTACAACTA CTTTCCGGCG GAATCGAGCG TGCAGCTGAC CATCGTGAAC ACCCCGATCC CGCGAGAGCT CATCGGGAAG AAGGTGTTCT TCGAGCCGAA CGACGAGCGT ACCCTGGGCT ACGTCGACGA GTACAACCGA ATCCTCAACG ACAAGATGTG CGAGGGCGTG TCGAATCTCG TCCGCAGGCG CTTCCTCACG TACTCGGTGC CTGCCGAGAG CGTGGAGGCT GCTGTGCCCA AGCTCTCGCA CATCAGGTCG GACATCGCGT CGACCTTGGC ACGCATCCGT TGCGATGCGG CGCCGCTTGG CGGTCAGGAA AGGCTTGAAG CAATCTCGAC GCTCGTTCGC CCGGGGCACA CGCCGTCGTG CGACTGGGAC AAGCTGTCCC AGCACCCGCG GCTGCGAACG AAGGACCTTA TCGCGCCGAA CCTCATGGAC TTCGCCCCCG CCGGGCGCGC CGACGCCGTC GAGATAGACG GCATGTACGC CAGCGTGCTC ACGCTGCGCG ATTTCGGAAG CGTGCTCGAA GACCACTACC TGGCTTCGAT CATCGATCTG CCGATACCGC TAGCCGTGTC GGTGCACATC GCGCCGATCG TGCAGTCGGA GGCGATCGCG CTCGTCAAGC GCCAGATCGA CTGGATGGAC AAGGAGATAA TCGACGAGCA GACGACGGCG GCGAAGAAGG GCTTCAGCCA ATCGATCCTC ACGCCAGAGA TCCGCTACGC CAAGGAGGAG GCCGAAGAGC TCCTCGACTT CCTCCGCAAC AAGAACGAGC ACCTCTTCGA GTACACGGGG CTCATCTACA CCTACGCCGA CTCGCTTGAG GCCCTCGACC GCCAGGTCGC GCAGGTGAAG TCCATCGCAC AGGGCAACTC CATCGCCGTG GCAAGCCTCG ATTTCCGCCA GCGCCCCGCG CTCAACTCGG TCCTTCCGCT CGGACGAAAC CACCTTGAGT TCACGCGCTA CCTGACCACG GGACAGATAG CCATGCAGAT GCCGTTCGCA TCGCTCGAGC TGAACCAGGA AGGCGGCGGC TACTACGGGC AGTCGAAGCA GTCCGGCAAC CTCGTGATCT GCAACAGGAA GCTCCTGGCA TCCCCAATGG GGTTCGTCTG CGGCAAGCCG GGCTCGGGCA AATCGTTCAG CGTCAAGCGA GAGATAACCA ACACCGTGCT CGCGCATCCC GAGGACCAGA TCATCGTGTT CGACCCGGCC GGCGAGTACC CGACCCTCAT CGAGGCGCTC GGCGGATCGA ACATCGCGTT CTCCCCAGAT TCGCCCACGC GCTTGAGCCC GTTCGACCGC TCGGACGTCG CCAACATGGC CACGACCAGC CAGATGGCGT TCAAGATCGA CGCCTTCCTG GCCCTTTCGT CGGCGATGAT GGCAGAGGGC GACGAGGGTT TGCCGGAGGC CGACAAGTCG ATAATCACGC GCTGCGTCGA GGCGGCCTAC GCTAAGCGCG GCGGCGACGG CACCCCGACG CTTTCCGACT TCTATGAAAT CTTGAAAGCC CAGCCCGAGC GGGAGTCCCG CGACATCGCG CTTCGCTACG AGCGCTATGT GAGCGGGCCG CTCTCGTTCT TCAACTGCCA GAGCAACGTC ACCTTCGACG AGCGCATCGT CAACGTCGAC CTCCACGAGC TGTCGAGCAG CATGCGCGTG TTCGGCATGC TCACCGCGCT GGAGGCCGTG CGAAACCGCA TGTACGCGAA CTACGAGCGC GGCGTGACTA CGTGGCTCTA CATCGACGAG GTGCAGTCCC TCTTCGGGCA CCCAGCGATC ATCGAGTACT TCACGAAGTT CTGGGCGGAG GGCCGCAAGT TCAACCTCAT CGCAACCGGC ATCACCCAGA ACTCGACCTA CATGCTGGCG CACGAGGACG CGCGCAACAT GGTGCTGAAC TCCGATTTCG TGCTGCTGCA CAAGCAGTCG GCGCTCGACA GGAAGAGCTG GGTCGACCTG CTGTCGCTTT CCGCCACCGA GGCGGGCTAC ATCGACGACA CCGTCAAGGC CGGCGAAGGG CTGCTCGTGG CCGGCGGGGT GCGGGTCCCC ATCCGCGACG ACTTCCCGAA GGGCGCGCTC TACGACCTGT TCAACACGAA AGCCACCGAG ATCGCCGAGC TGAAGCGAAA GGCGCGCCGC GAGGGAGCCG GCCTATGA
|
Protein sequence | MRVQGAASRL LAALASRRSK EKDPGRDEAR ARSSKKRCSS ADAASFIAYD ALYKDGIAEV EPGLFSQTVE FSDISYQSAR KETQEQIFTT LSSLYNYFPA ESSVQLTIVN TPIPRELIGK KVFFEPNDER TLGYVDEYNR ILNDKMCEGV SNLVRRRFLT YSVPAESVEA AVPKLSHIRS DIASTLARIR CDAAPLGGQE RLEAISTLVR PGHTPSCDWD KLSQHPRLRT KDLIAPNLMD FAPAGRADAV EIDGMYASVL TLRDFGSVLE DHYLASIIDL PIPLAVSVHI APIVQSEAIA LVKRQIDWMD KEIIDEQTTA AKKGFSQSIL TPEIRYAKEE AEELLDFLRN KNEHLFEYTG LIYTYADSLE ALDRQVAQVK SIAQGNSIAV ASLDFRQRPA LNSVLPLGRN HLEFTRYLTT GQIAMQMPFA SLELNQEGGG YYGQSKQSGN LVICNRKLLA SPMGFVCGKP GSGKSFSVKR EITNTVLAHP EDQIIVFDPA GEYPTLIEAL GGSNIAFSPD SPTRLSPFDR SDVANMATTS QMAFKIDAFL ALSSAMMAEG DEGLPEADKS IITRCVEAAY AKRGGDGTPT LSDFYEILKA QPERESRDIA LRYERYVSGP LSFFNCQSNV TFDERIVNVD LHELSSSMRV FGMLTALEAV RNRMYANYER GVTTWLYIDE VQSLFGHPAI IEYFTKFWAE GRKFNLIATG ITQNSTYMLA HEDARNMVLN SDFVLLHKQS ALDRKSWVDL LSLSATEAGY IDDTVKAGEG LLVAGGVRVP IRDDFPKGAL YDLFNTKATE IAELKRKARR EGAGL
|
| |