Gene Elen_0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0918 
Symbol 
ID8415208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1123517 
End bp1125550 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content63% 
IMG OID645023883 
ProductTRAG family protein 
Protein accessionYP_003181280 
Protein GI257790674 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACCC TCAAGGGATT TCTGTACCTC ATCCTCGGCG GCACGCTCTG CGGCTGGATG 
TTCAACCGAA TCGCCGCATG GTTCATCGAC AACCCGCTCA CCGTCGGCTC CAACCACACC
GTCGCCGAGT GGGCCGTGAT CCTGCAAGAC CCGTTCTACC TCGACTCCCG CACGATGCCG
TTCTTCTTTC TGGCCTTCGG CGCCATCGCG CTGGTGGCCA TGACAAAATA CGACTGGACC
GGCGAGCGCG AGGAGCAGAA GAAGCTCCGC GGCGAGGAGT ACGGCAACCA GCGCTGGGCG
CGCGACGACG AGATGCAGCA GTTCGCGCAC ACCTCGACCG TGAAGCGCGT GCCCATCCGC
ATCCCGCAGC GCACCGCCGA CGCGATGCGC TTCGCCCGCA ACAACCCGAA GGACTTCATC
AAGGCCAAGC TCGGCATGAC CAACAAGGTC GCCAACCCCA AGCCCGACTA CGTCGAGAAG
ATCGAGGACG ACAACATCAT CCTGTCCGAG CGCGCCGAGC TCCAGATGTC GAAGATCCCC
GACCCGGCGC TCGAGCGCAA CAAGCACGTC TACGTGCTCG GCGGCTCCGG CTCGGGCAAG
ACCTTCAACT TCGTCGGCCC CAACCTTCTC CAGCTCAACA GCTCGATCGT GACGACCGAC
CCCAAGGGCG ACACGCTCAA GCAGTACGGC AACTTCTTCC TGCGCCACGG CTACAAGCTC
AAGGCTGTCA ACACCAAGCC CGACCAGATC AACCAATCGA TGCACTACAA CCCGCTGCTC
TACCTCCAGG ACTCCACGTC GATCATGCAG ATCGTCAACC TGCTGGTGGA GAACACGTCG
GGCAACGCGG AGGCCGAGAA GGAGGACTTC TTCGTCAAGG CCGAGAGGCA GCTGTACATG
GCGCTGATGG GCTACCTCTT CTACTTCTAC GCGGACCAGC CGCAGTACCA GACGTTCCCG
CAGATGCTCG ACCTGCTCCA GCTCGCCGGC AAGGACAACC CCAGCCAGAC CAAGACCCCG
CTGGACATCA TCATGCTCGG CACGACCGCC GAGGACGGCT TCCAGGGCTT CGAGGAATGG
ATCGTCGCCA ACCACGGCGG CGACGAGGCG GCCGCGCAGG CCTCCGAGGA GTACTTCGTC
ATCAAGCAGT ACAAGGGCTT CAAGTCGACC TCCGAATCGC CGGAGACCGA GGCCTCGGTC
ATCGCGTCGT GCAACGTCCG CCTGGCGCCG TTCGCCGTCT CCGCGGTCCG CGAGTTCTTC
AGCGAGGACG AGCTGGAGCT CGAGATGATC GGCGAGGAGC GCACGGCGTT CTTCCTGGTC
ATGTCCGACA CCGACAAGAC GTTCAACTTC ATCCTGGCGA TGCTGCTCTA CCAGCTGTTC
GACGTCAACA CTGCCATCGC CGACAGGAAC CCCGGCTCGC ACTGCAAGAT CCCGATCAAC
TGCATCCTCG ACGAGCTCGC CAACATCGGC CGCATCCCCG ACCTCGACGT CAAGATCGCG
ACCCTGCGCT CGCGCTGGAT CTACATCACG GCCATCCTGC AATCGGTCAC GCAGCTCAAG
AAGATGTACA AGGACAACGC CGACATCATC GAGGGCAACT GCGACACGAC GCTGTTCCTC
GGCCGCTGCG ACCTCGAGAC CAACAAGAAG ATCTCCGAGC GCCTGGGCAA GTTCACCGCC
ACCGTCCGCA ACCGCAGCGA GTCGCACGGC CGGCAGGGAT CGTGGTCCGA GAGCGAGAAC
AAGATCGGCA AGGAGCTCAT GGCCGCCGCC GACCTCGGCA ACAACCCCGA GAAGTTCGGC
GGCGACGACT GCATCGTCTT CGTGAAGAAC GCCTTCCCGT TCCTCGACAA GAAGTACAGG
ACGATCGACC ACCCGCGCTA CCACGAGCTG CGCGAGGTCG GCGAGTTCAA CCTGGACGAC
TGGAACTGGG ACCGCAAGTG CGAGCGCGAG CGCGCGCACC GCGCCGAGGT CGAGGAGATG
CGCTGGCGCA TCGAGGAGGC CCGTTCGTTC TTCGACCCCG AGTTCTTCAT GTAG
 
Protein sequence
MRTLKGFLYL ILGGTLCGWM FNRIAAWFID NPLTVGSNHT VAEWAVILQD PFYLDSRTMP 
FFFLAFGAIA LVAMTKYDWT GEREEQKKLR GEEYGNQRWA RDDEMQQFAH TSTVKRVPIR
IPQRTADAMR FARNNPKDFI KAKLGMTNKV ANPKPDYVEK IEDDNIILSE RAELQMSKIP
DPALERNKHV YVLGGSGSGK TFNFVGPNLL QLNSSIVTTD PKGDTLKQYG NFFLRHGYKL
KAVNTKPDQI NQSMHYNPLL YLQDSTSIMQ IVNLLVENTS GNAEAEKEDF FVKAERQLYM
ALMGYLFYFY ADQPQYQTFP QMLDLLQLAG KDNPSQTKTP LDIIMLGTTA EDGFQGFEEW
IVANHGGDEA AAQASEEYFV IKQYKGFKST SESPETEASV IASCNVRLAP FAVSAVREFF
SEDELELEMI GEERTAFFLV MSDTDKTFNF ILAMLLYQLF DVNTAIADRN PGSHCKIPIN
CILDELANIG RIPDLDVKIA TLRSRWIYIT AILQSVTQLK KMYKDNADII EGNCDTTLFL
GRCDLETNKK ISERLGKFTA TVRNRSESHG RQGSWSESEN KIGKELMAAA DLGNNPEKFG
GDDCIVFVKN AFPFLDKKYR TIDHPRYHEL REVGEFNLDD WNWDRKCERE RAHRAEVEEM
RWRIEEARSF FDPEFFM