Gene Elen_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0914 
Symbol 
ID8415204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1118273 
End bp1121404 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content67% 
IMG OID645023879 
ProductType IV secretory pathway VirB4 protein-like protein 
Protein accessionYP_003181276 
Protein GI257790670 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCA TCCAGTCCGC GCCGCCCGCG GCGCCTGCGC CCTCGCCGGC ATACCGAGCT 
GAGGACATCC CCGTGTCCGC GGCCCCGCAG CCGGCCGCCC GCGGGGCCCA CGAGCAGGTC
TCACAGGTCT CACAGAGCCC GGAGCCGCCG TTCGCATACG GCCGCGAGGA GGCCGCCGAA
TCCCCGGCAT CCCCCGCCCG CGACCCCTAC GAGGACTACT ACCGCCAGGC CCCGGCACGC
CCGAGCGCGC AGCCATACCG CGAGCCTGCG CCCCAACCGG CCCGCGACCC ATATGAGGAC
TACTACCGCC CCACCCCGGG CCGTGGCGGG CAGGGCCCTG CCGCCCGCGA CCCCGACGGG
GAGCCCGCCG CATCCGGCGA GTCCGACGCC TACGCCTACG GGAAGAACGC ATATGCGGAC
ACCTCGCGCG ACCCCTACGA GGACTACTGC CGCCAGAGCC AGACCCCGGC GCGCCCGAGC
GCACCGTCAT ACCGCGAGCC TGCGCCCCAG CCGGCCCGCG ACCCCTACAA TGACTACCGC
GACGCGGGCC GCGCAGATGA CGCGGACGGC CGCGGACAGC GCGGGCAGCG CGAGGAGAGC
CGCGAGGACG TCCGCGCCGA GGGCCGCGAC GCCCGCCGCC AGCAGAAGCT CAACGCGAAG
GCCGCGAAAG CCCGAAAGCG CCGCGAGCGC GCCGCGGAGA AGGCGGGAAA GAAGAAGGGC
ATGAAGCGCC TTCTGCCGAC CGAGGACTAC CCTGGCACCC TCGTCCTGCG CGCGGACAAG
TGCACGCAGG ACGCGCTCGG CTATGAGCTC CTGTGCGAGG ACGGCACCGT CAAGCTCCAG
CAGGGCGTCT ACAGCCGCGT CGTGGAGTTC CAGGACGCCA GCTTCCAGGC GGCGCGCGAG
TCCGAGCAGC GCGAGATCTA CGAGAACTGG TCCGAACTTC TCAACACGTT CGACAACACC
GTGCACCTCC AGGTCAAGAT CCTGTGCCGC GTCATCGACC GAGACGCGTT CCGCGAGGAC
ACCTTCCTGC CGCCCGTCGA GGGCGACTAC GCCGGCAACC GCTTCCGCCG CGACATCAAC
CAGATCATCG AGAGCAAGGT CGCCGAGACG CAGCAGAACG TCGAGAGGCG CCGGCTGTTC
ATCGTGACCG TCGAGGCGCC GACCTGCGAG CAGGCGTCCC CGCTGCTCGC CCGCGCGACC
GAGCAGGTCA TGCGCTCGCT CAAGAACATG GGGGTCAACT CGGAGGAGGT CCGCGGAAAC
GAGCTTCTGC GCATCATCGA CTCGATCACC AACCCGCGCG ACCCGCGCGG CTTCGTCTCG
TTCGAGGACC TGAAGGTCAC CGACGAGCGC GGCGTGAGCG CCATCCAGCT CGGCTACACC
ACCAAGGACC TCGTCGCGCC CGCCGACCTC ACCAAGATCG ACGACACCCA TATCTCATGG
AACGGCGTCA CGGGCCAGGC GCTCTACCTC CAGAAATGGG CGGGCTCGGT GCGCTCCGAC
ATGATCTCGA GCCTGGCGGA GCTGCCCATC AACCAGGTGA TCACGCTCGA CATGACGAGC
TGGGAGCAGT CGCGCGCCAT CGAGACCATC GAGTCGATGA ACACCGATCT CAAGGTGCAG
AAGTCCGACT ACGTGCTCAA GCACTCCCAG ACCATGTACA TCACCGACGA GATGCTGCCG
ACCAACCTCC AGGACGCCAT GGAGAACGCC CGCGACCTGC GCGACGACCT CGTGAGCCGC
GACCAGAAGA TGTGGTCGCT CACCTGCACC ACCATGACCT GGGCGGACTC GCTCGAGGAC
TGCGACGAGA ACTCCGGGGC CATCCAGGAC GTGTTCCGCC GATTCACCTG CCGCGCAGTG
CCCCTCGTGA AGCTCCAGCG ACAGGGTTTC GCCGCGATGC TGCCCACGGG CCGCTGCGAC
ATCCCCTACG TACGCAACCT CACGACCGCC CCGCTCGCGG CGCTCGTGCC GTTCACGTCG
GTCGAGCTCA TGGAGCGCGG CGGCATGTGG ATGGGCCAGA ACCAGACGTC CAAGAACTTC
ATCTTCTACA ACCGCCGCGA CGCCGTCGCG CCCAACGGCT TCATCCTGGG CAAGCCGGGC
CGCGGCAAGT CGGTGACGGC CAAGAACACG ATCCTGTGGA CGCTTCTCAC CGACCCCACC
GCCGAGGTCA TCGTGCTCGA CCCCGAGCGT GAGTACATCA ACGTCGCGCG CGAGATGGAC
GGCGAGGTCG TCCAGATCTC CGGAGACTCG CATACCTATA TCAACCCGTT CGATCTCGAG
CTCGTGGAGG GCGAACAGCC GCTCGCGATG AAGGTCGACG CCATCATGTC GATGGTCGAG
ATGATGGCCA AGAACCTCTC GCCGATGCAG AAGACCCTCG TGGACCGCTG CGTCTCGCGC
ATCTACGACC GCTACTTCGC CACCCACGAC CAGCGCGACA TCCCCACGCT GATCGACTTC
TACAACATGC TCAACCAGCA GCCCGAGCCC GAGGGACGCA TGCTCGCCGT CACCATCGAG
CGCTACGTCA CCGGCCAGGC CTCGCTGTTC AACCACCCGA CCAACGTGAA CACGCACAAG
CGCTTCGTCG TCTACGACAT CCGCGACTGC GCCGACAACA TGAAGGGCCT GGCGCTGCTG
ATCCTGCTCG ACCAGACGTG GAACCGCATC GTGCGCAACC GCGAGCGCCA CGTCCGCACC
TGGGTGTTCA TCGACGAGAT GCAGCTGCTC TTCGAGAACG ACTACGCGAT CTCGTACTTC
GACCAGCTCT GGACCCGCTC GCGCAAGTAC GGCGCCATCC CGACCGGCAT CACGCAGAAC
ATCGAGCGCA TCATCAACAA CGAGAAGAGC CGCCTGATGC TGGCTAACTC CGACTTCCTG
GTGCTGCTCG GCCAGTCCGC ATCGGACGCC GCGGCGCTGG GCGAGGTCAT CAAGCTCTCC
GAGCGCCAGG TCGCGATGAT GCGCAACGCC GGGCCCGGAG AGGGCCTGCT GGTCGCCGGC
GGCAAGATCA TCCCGTTCGA GAACCGCATC CCGACGGACT CGGCCATCTA CCGCATGGTC
ACCACCAAGC TCGACGACCT GATCCAGTAC TCCAACGAGG ACGGGCGCGG CGACGGCGCC
CGCCGCGGGT AG
 
Protein sequence
MPAIQSAPPA APAPSPAYRA EDIPVSAAPQ PAARGAHEQV SQVSQSPEPP FAYGREEAAE 
SPASPARDPY EDYYRQAPAR PSAQPYREPA PQPARDPYED YYRPTPGRGG QGPAARDPDG
EPAASGESDA YAYGKNAYAD TSRDPYEDYC RQSQTPARPS APSYREPAPQ PARDPYNDYR
DAGRADDADG RGQRGQREES REDVRAEGRD ARRQQKLNAK AAKARKRRER AAEKAGKKKG
MKRLLPTEDY PGTLVLRADK CTQDALGYEL LCEDGTVKLQ QGVYSRVVEF QDASFQAARE
SEQREIYENW SELLNTFDNT VHLQVKILCR VIDRDAFRED TFLPPVEGDY AGNRFRRDIN
QIIESKVAET QQNVERRRLF IVTVEAPTCE QASPLLARAT EQVMRSLKNM GVNSEEVRGN
ELLRIIDSIT NPRDPRGFVS FEDLKVTDER GVSAIQLGYT TKDLVAPADL TKIDDTHISW
NGVTGQALYL QKWAGSVRSD MISSLAELPI NQVITLDMTS WEQSRAIETI ESMNTDLKVQ
KSDYVLKHSQ TMYITDEMLP TNLQDAMENA RDLRDDLVSR DQKMWSLTCT TMTWADSLED
CDENSGAIQD VFRRFTCRAV PLVKLQRQGF AAMLPTGRCD IPYVRNLTTA PLAALVPFTS
VELMERGGMW MGQNQTSKNF IFYNRRDAVA PNGFILGKPG RGKSVTAKNT ILWTLLTDPT
AEVIVLDPER EYINVAREMD GEVVQISGDS HTYINPFDLE LVEGEQPLAM KVDAIMSMVE
MMAKNLSPMQ KTLVDRCVSR IYDRYFATHD QRDIPTLIDF YNMLNQQPEP EGRMLAVTIE
RYVTGQASLF NHPTNVNTHK RFVVYDIRDC ADNMKGLALL ILLDQTWNRI VRNRERHVRT
WVFIDEMQLL FENDYAISYF DQLWTRSRKY GAIPTGITQN IERIINNEKS RLMLANSDFL
VLLGQSASDA AALGEVIKLS ERQVAMMRNA GPGEGLLVAG GKIIPFENRI PTDSAIYRMV
TTKLDDLIQY SNEDGRGDGA RRG