Gene Elen_0745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0745 
Symbol 
ID8415035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp936623 
End bp939538 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content51% 
IMG OID645023716 
Productpentapeptide repeat protein 
Protein accessionYP_003181113 
Protein GI257790507 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.888113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGTG AAGCGGTTAC CGTTTCTTCG GTTGTTTCTG CATTCATCTC TGAGTTCGTC 
GCCAAGACAA CGCAATGGGC TGAACAAGGA CGCAAAAAGG CCAAAACAAA GGAATCGCTT
TCAGGTGCGC TCGACAACCT GACCTCAGAT CTGAGGTCAG CGCTTGAGCA AGCAGCCGAT
GTAGGCCTCG ATGAAAACGG CTTGTCGTTT TGCGACTGGC TCGGCGAAGA GACCGAGTAT
ATCTACGACT CGGCACTGGA GACGAAAGAG GAAAGGGATT CGCGAATCGA AGCGCTCCTC
CTTGAAGGTC AAAGAAAATT CGGTTTCAAA GGCGGCGAGG ACAAGGCGGA ATCCATAGCC
GAAGCGATAA GCATCGCATT CGATTACTGT CTCAAACTGA GATTTGGGCA TCTAAACGAC
GACGACAGAA CCGTCGTCAA CATGACGGTA ACATACGGAG AAGAGACGGT CGACCGGCAA
ACGACTGAGA TAATCTACGC CCTAGATCCA ACACTCGCGA CCGATCGTGA GTGGCTGAAG
TACCAGGATT TCCTTGCTTC CCAATCCATG ACGGCGATAG GCACCAAATT CACCGTCGAC
GAGCTCTACA TACCGCTCAA CGCCATCGAG ACTGCGGAGG CCACGTTTTA TCCTCCACAC
CAAGATTCCC ATGGCCGCCT CTTCGATCTC ATCGAGAGTA ATAATCGTTT AGAAATCCAG
TTGTTTCCCG AAACCGTTGC AGAGGCATCC GTTGCCCATT GGCCTATCGA GCATAGCCAT
ACCTCAACTT TCACCGATAC AGTAAGCTTA GTCGTTTCAG TAGATGAGCG TATCGCGCAG
TGGCTGGAAA CGCCCGATTT TAGGGACGAC CACAACCCTG TCCTTGTTCT TTCCGGAGAT
CCCGGAAGCG GAAAATCCAC CGTTGCCAGG CGCCTTTCTA AAACACTGGC CAAAACCGAA
GAGGTCAACG TTGCTTACAT CAGCCTCAAA GACGTGGCGA CGGATTCGCA AATCGACGAT
ATCGAGGCCC TCGTCGTGAA ATACATCCTA TCGTTGCCCG GTCGCGACTT CGTTGTTTCG
AAGCTTCCAA CGAGCAAGCC TCTGGTTCTC ATATTCGATG GACTCGATGA ATATGCAGCG
CGAGGACCTA AGAGCAAAAA GGCAGCATGG GGTCTCCTGG GATCCATACT TCGATACGGA
CAGCGTTGCA GCGAAGCCTC CTTCCCAACC AGAGTCCTCG TCACTTCAAG GACCACACTG
CTTCGAGATA TGAAAAAAGA AGTATCTCCA AAAGAGTGCA AGCCGATTCG ACTCGAGCTG
CTTCCCTACG TTTGCTCCGA AAGTGAGTTA GGCGACATTT GCGACCCAGA CTTCCTCCTT
TGTGAAGATA AAAGGATTAC TTGGTGGACG AACTACGGAG AGAGACTTGG TAGGGATATG
AGCTCGACGA TGGAAAAGAT TTTCAAGTGG GACGACGAAG TGAAGTTATC GGCCCAGCCT
ATCCTCAACC ATCTGATAGC CGTGTTCGCA AAGAATATTC TCGAGACGGC TGCACCCAAC
CGCGCCGAGG TGTACGGAAC TTTGCTTAAG GGCGTGATAA ACCGGAATTA CGATCACGCG
CAAGGAAAGG CGAAAATACA TAGCAGAACA CGCGCAGCGG ACGTGGCGGG GTATCTCGAG
TTCATGGAGG CCGCTGCTAT TATGGCATGG CACAACGGCG GATCTATCAC GGATTTAAAG
CCCTTGAAAA ACGAATGCGG GAGCGAACGC GCTAAAGAAG CCTTTGATTG CTTCAAAAGC
GAGGCGCGCA TCAAGGATAA TATAATCGCT GGTTACTTTC ACCTAAGAAA CAAAAACGGC
GAGGAGAGTT ACGAGTTCGT TCATTACAGT TTCATGGAGT ACCTCGTATC GCGCCGCATC
GTCGGCGAGT TGACCAAAAT GCTCGACAGG AAAGTTTGCC CCATCACATC GATGCCAAAG
CTTTACGACA TGCTGGGCTG TTCGGAACTC ACGGACAACA CGCTATCGTT CATAAGGGCC
GAACTCTCTC TGTTCACCAA AAAGAAAGCC GCCTCGCTTC AAAAATACAT GATGTCGTTA
TTCATGGAAT CCCTATTGGA ATACAGCTTC GATGAGACGT TTTTTGATAA GGAAAGAAAC
GGAACACTGT TTTCGACAAA GTTATCATCC ATTCGCAACG TCCAGGGGAA CATTCTGGCG
CTACACAGCT GCGCTGCCAA AGTGACTGGC GAGAGGATGG GCGTATCTTT CAACCTATTG
TTACAATGGT TGGGAGGGAT CGGTGTGTTT GCATGGGAAA GCAACGTTTC TTCTTTCTTC
AATGGTCTTG AGCCGATCAT AGAGAGCGGG GAAGATGAAT ACATTCTGAC GCTGCCTTTC
GCGCAACTCA GTTCGTCGGA TATGTCCAAT TCGTTCATGG AACACGCAAT ATTCACCGGA
GCAATGCTTG ATAACACCAT CGTAGTCAAC GCGGATATGA AGCATGCGAA TTTCAATGAC
GCCCGACTCG TTGACGCGAA ATGCTCATAT GCCCATTTCG AACACGCGTC CCTGGAAAAA
GCCACTCTTC ATGGAGCGCA CTTCGATCAC GCCCACCTCG AGAACGCCCA CTTGCTAGGA
GCAGAGTTGG AGGGGGCGAA ATTCCAGCAC GCCCACCTCG AAGATGCGGA TTTGCGGTTC
GCAAAGCTCA TAGAGGCAGA ATTTGGATGG GCGAAAATGA GGGGCTCAAA TATGGGAGGG
GCCATCCTCC GCGAGGCTGA TCTTCGGTAT GCCGAACTCA AGGGATGCAA TCTCGAAAAC
GCGATCCTCG ACAAGGCGAA GGTGCTGAAG AAGGACATCA AAACACTCCA CGAATGCGGT
GCGGACGTTT CAAAAGTTAT AGCGTACGAC GAATGA
 
Protein sequence
MFGEAVTVSS VVSAFISEFV AKTTQWAEQG RKKAKTKESL SGALDNLTSD LRSALEQAAD 
VGLDENGLSF CDWLGEETEY IYDSALETKE ERDSRIEALL LEGQRKFGFK GGEDKAESIA
EAISIAFDYC LKLRFGHLND DDRTVVNMTV TYGEETVDRQ TTEIIYALDP TLATDREWLK
YQDFLASQSM TAIGTKFTVD ELYIPLNAIE TAEATFYPPH QDSHGRLFDL IESNNRLEIQ
LFPETVAEAS VAHWPIEHSH TSTFTDTVSL VVSVDERIAQ WLETPDFRDD HNPVLVLSGD
PGSGKSTVAR RLSKTLAKTE EVNVAYISLK DVATDSQIDD IEALVVKYIL SLPGRDFVVS
KLPTSKPLVL IFDGLDEYAA RGPKSKKAAW GLLGSILRYG QRCSEASFPT RVLVTSRTTL
LRDMKKEVSP KECKPIRLEL LPYVCSESEL GDICDPDFLL CEDKRITWWT NYGERLGRDM
SSTMEKIFKW DDEVKLSAQP ILNHLIAVFA KNILETAAPN RAEVYGTLLK GVINRNYDHA
QGKAKIHSRT RAADVAGYLE FMEAAAIMAW HNGGSITDLK PLKNECGSER AKEAFDCFKS
EARIKDNIIA GYFHLRNKNG EESYEFVHYS FMEYLVSRRI VGELTKMLDR KVCPITSMPK
LYDMLGCSEL TDNTLSFIRA ELSLFTKKKA ASLQKYMMSL FMESLLEYSF DETFFDKERN
GTLFSTKLSS IRNVQGNILA LHSCAAKVTG ERMGVSFNLL LQWLGGIGVF AWESNVSSFF
NGLEPIIESG EDEYILTLPF AQLSSSDMSN SFMEHAIFTG AMLDNTIVVN ADMKHANFND
ARLVDAKCSY AHFEHASLEK ATLHGAHFDH AHLENAHLLG AELEGAKFQH AHLEDADLRF
AKLIEAEFGW AKMRGSNMGG AILREADLRY AELKGCNLEN AILDKAKVLK KDIKTLHECG
ADVSKVIAYD E