Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0914 |
Symbol | |
ID | 8415204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1118273 |
End bp | 1121404 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645023879 |
Product | Type IV secretory pathway VirB4 protein-like protein |
Protein accession | YP_003181276 |
Protein GI | 257790670 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCCA TCCAGTCCGC GCCGCCCGCG GCGCCTGCGC CCTCGCCGGC ATACCGAGCT GAGGACATCC CCGTGTCCGC GGCCCCGCAG CCGGCCGCCC GCGGGGCCCA CGAGCAGGTC TCACAGGTCT CACAGAGCCC GGAGCCGCCG TTCGCATACG GCCGCGAGGA GGCCGCCGAA TCCCCGGCAT CCCCCGCCCG CGACCCCTAC GAGGACTACT ACCGCCAGGC CCCGGCACGC CCGAGCGCGC AGCCATACCG CGAGCCTGCG CCCCAACCGG CCCGCGACCC ATATGAGGAC TACTACCGCC CCACCCCGGG CCGTGGCGGG CAGGGCCCTG CCGCCCGCGA CCCCGACGGG GAGCCCGCCG CATCCGGCGA GTCCGACGCC TACGCCTACG GGAAGAACGC ATATGCGGAC ACCTCGCGCG ACCCCTACGA GGACTACTGC CGCCAGAGCC AGACCCCGGC GCGCCCGAGC GCACCGTCAT ACCGCGAGCC TGCGCCCCAG CCGGCCCGCG ACCCCTACAA TGACTACCGC GACGCGGGCC GCGCAGATGA CGCGGACGGC CGCGGACAGC GCGGGCAGCG CGAGGAGAGC CGCGAGGACG TCCGCGCCGA GGGCCGCGAC GCCCGCCGCC AGCAGAAGCT CAACGCGAAG GCCGCGAAAG CCCGAAAGCG CCGCGAGCGC GCCGCGGAGA AGGCGGGAAA GAAGAAGGGC ATGAAGCGCC TTCTGCCGAC CGAGGACTAC CCTGGCACCC TCGTCCTGCG CGCGGACAAG TGCACGCAGG ACGCGCTCGG CTATGAGCTC CTGTGCGAGG ACGGCACCGT CAAGCTCCAG CAGGGCGTCT ACAGCCGCGT CGTGGAGTTC CAGGACGCCA GCTTCCAGGC GGCGCGCGAG TCCGAGCAGC GCGAGATCTA CGAGAACTGG TCCGAACTTC TCAACACGTT CGACAACACC GTGCACCTCC AGGTCAAGAT CCTGTGCCGC GTCATCGACC GAGACGCGTT CCGCGAGGAC ACCTTCCTGC CGCCCGTCGA GGGCGACTAC GCCGGCAACC GCTTCCGCCG CGACATCAAC CAGATCATCG AGAGCAAGGT CGCCGAGACG CAGCAGAACG TCGAGAGGCG CCGGCTGTTC ATCGTGACCG TCGAGGCGCC GACCTGCGAG CAGGCGTCCC CGCTGCTCGC CCGCGCGACC GAGCAGGTCA TGCGCTCGCT CAAGAACATG GGGGTCAACT CGGAGGAGGT CCGCGGAAAC GAGCTTCTGC GCATCATCGA CTCGATCACC AACCCGCGCG ACCCGCGCGG CTTCGTCTCG TTCGAGGACC TGAAGGTCAC CGACGAGCGC GGCGTGAGCG CCATCCAGCT CGGCTACACC ACCAAGGACC TCGTCGCGCC CGCCGACCTC ACCAAGATCG ACGACACCCA TATCTCATGG AACGGCGTCA CGGGCCAGGC GCTCTACCTC CAGAAATGGG CGGGCTCGGT GCGCTCCGAC ATGATCTCGA GCCTGGCGGA GCTGCCCATC AACCAGGTGA TCACGCTCGA CATGACGAGC TGGGAGCAGT CGCGCGCCAT CGAGACCATC GAGTCGATGA ACACCGATCT CAAGGTGCAG AAGTCCGACT ACGTGCTCAA GCACTCCCAG ACCATGTACA TCACCGACGA GATGCTGCCG ACCAACCTCC AGGACGCCAT GGAGAACGCC CGCGACCTGC GCGACGACCT CGTGAGCCGC GACCAGAAGA TGTGGTCGCT CACCTGCACC ACCATGACCT GGGCGGACTC GCTCGAGGAC TGCGACGAGA ACTCCGGGGC CATCCAGGAC GTGTTCCGCC GATTCACCTG CCGCGCAGTG CCCCTCGTGA AGCTCCAGCG ACAGGGTTTC GCCGCGATGC TGCCCACGGG CCGCTGCGAC ATCCCCTACG TACGCAACCT CACGACCGCC CCGCTCGCGG CGCTCGTGCC GTTCACGTCG GTCGAGCTCA TGGAGCGCGG CGGCATGTGG ATGGGCCAGA ACCAGACGTC CAAGAACTTC ATCTTCTACA ACCGCCGCGA CGCCGTCGCG CCCAACGGCT TCATCCTGGG CAAGCCGGGC CGCGGCAAGT CGGTGACGGC CAAGAACACG ATCCTGTGGA CGCTTCTCAC CGACCCCACC GCCGAGGTCA TCGTGCTCGA CCCCGAGCGT GAGTACATCA ACGTCGCGCG CGAGATGGAC GGCGAGGTCG TCCAGATCTC CGGAGACTCG CATACCTATA TCAACCCGTT CGATCTCGAG CTCGTGGAGG GCGAACAGCC GCTCGCGATG AAGGTCGACG CCATCATGTC GATGGTCGAG ATGATGGCCA AGAACCTCTC GCCGATGCAG AAGACCCTCG TGGACCGCTG CGTCTCGCGC ATCTACGACC GCTACTTCGC CACCCACGAC CAGCGCGACA TCCCCACGCT GATCGACTTC TACAACATGC TCAACCAGCA GCCCGAGCCC GAGGGACGCA TGCTCGCCGT CACCATCGAG CGCTACGTCA CCGGCCAGGC CTCGCTGTTC AACCACCCGA CCAACGTGAA CACGCACAAG CGCTTCGTCG TCTACGACAT CCGCGACTGC GCCGACAACA TGAAGGGCCT GGCGCTGCTG ATCCTGCTCG ACCAGACGTG GAACCGCATC GTGCGCAACC GCGAGCGCCA CGTCCGCACC TGGGTGTTCA TCGACGAGAT GCAGCTGCTC TTCGAGAACG ACTACGCGAT CTCGTACTTC GACCAGCTCT GGACCCGCTC GCGCAAGTAC GGCGCCATCC CGACCGGCAT CACGCAGAAC ATCGAGCGCA TCATCAACAA CGAGAAGAGC CGCCTGATGC TGGCTAACTC CGACTTCCTG GTGCTGCTCG GCCAGTCCGC ATCGGACGCC GCGGCGCTGG GCGAGGTCAT CAAGCTCTCC GAGCGCCAGG TCGCGATGAT GCGCAACGCC GGGCCCGGAG AGGGCCTGCT GGTCGCCGGC GGCAAGATCA TCCCGTTCGA GAACCGCATC CCGACGGACT CGGCCATCTA CCGCATGGTC ACCACCAAGC TCGACGACCT GATCCAGTAC TCCAACGAGG ACGGGCGCGG CGACGGCGCC CGCCGCGGGT AG
|
Protein sequence | MPAIQSAPPA APAPSPAYRA EDIPVSAAPQ PAARGAHEQV SQVSQSPEPP FAYGREEAAE SPASPARDPY EDYYRQAPAR PSAQPYREPA PQPARDPYED YYRPTPGRGG QGPAARDPDG EPAASGESDA YAYGKNAYAD TSRDPYEDYC RQSQTPARPS APSYREPAPQ PARDPYNDYR DAGRADDADG RGQRGQREES REDVRAEGRD ARRQQKLNAK AAKARKRRER AAEKAGKKKG MKRLLPTEDY PGTLVLRADK CTQDALGYEL LCEDGTVKLQ QGVYSRVVEF QDASFQAARE SEQREIYENW SELLNTFDNT VHLQVKILCR VIDRDAFRED TFLPPVEGDY AGNRFRRDIN QIIESKVAET QQNVERRRLF IVTVEAPTCE QASPLLARAT EQVMRSLKNM GVNSEEVRGN ELLRIIDSIT NPRDPRGFVS FEDLKVTDER GVSAIQLGYT TKDLVAPADL TKIDDTHISW NGVTGQALYL QKWAGSVRSD MISSLAELPI NQVITLDMTS WEQSRAIETI ESMNTDLKVQ KSDYVLKHSQ TMYITDEMLP TNLQDAMENA RDLRDDLVSR DQKMWSLTCT TMTWADSLED CDENSGAIQD VFRRFTCRAV PLVKLQRQGF AAMLPTGRCD IPYVRNLTTA PLAALVPFTS VELMERGGMW MGQNQTSKNF IFYNRRDAVA PNGFILGKPG RGKSVTAKNT ILWTLLTDPT AEVIVLDPER EYINVAREMD GEVVQISGDS HTYINPFDLE LVEGEQPLAM KVDAIMSMVE MMAKNLSPMQ KTLVDRCVSR IYDRYFATHD QRDIPTLIDF YNMLNQQPEP EGRMLAVTIE RYVTGQASLF NHPTNVNTHK RFVVYDIRDC ADNMKGLALL ILLDQTWNRI VRNRERHVRT WVFIDEMQLL FENDYAISYF DQLWTRSRKY GAIPTGITQN IERIINNEKS RLMLANSDFL VLLGQSASDA AALGEVIKLS ERQVAMMRNA GPGEGLLVAG GKIIPFENRI PTDSAIYRMV TTKLDDLIQY SNEDGRGDGA RRG
|
| |