Gene Elen_1026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1026 
Symbol 
ID8415316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1243020 
End bp1245071 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content66% 
IMG OID645023990 
Producthypothetical protein 
Protein accessionYP_003181387 
Protein GI257790781 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0776826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.319246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGAAC AAACGATCAT CCCTGCGGGG GTCATGCGGC GGGCGTGGGC CCTCCTGCTG 
GCCGCCGCGC TCTGCCTGGG CCTCATGCCC AGCGCAGCCT GGGCCGAAGA AAGCGAGGGT
GCGGGGACGT TCTCGGTGGC GCTCACCATC GTGGACACGT CCGACCCCGC GGACGGCGTC
CTGTACAACG GCAAGGTCGA CGGCATGACC TCCGACGACA CGGTTGCCGA CCTGCTGGCG
AAGGCGGGCT TCACCGCCGC GGCCAGCGCG GAGGAGACCG AGGGGAACGA CAAGGCGTAC
TTCGACTCCT GGGGCTCCCC GACGTTCCGC GGCAACAAGT CGGTCCAGCA GCCCGACGGC
TCGTGGGCCT ATTGGGCGAC GATGTTCGAC GGCGACAGCG CGAACTACGC CAGCGCCCAG
CTGACGTCGA AGCTGCAGGA GAACGGCCGC TACCAGTACA TCTACACGTC CGACGCGACG
TTCGCCTACG ACGAGGAGGC TTCCGGATTT CCGGTGCAGC TCACCATCGT GGACACGTCC
GACCCCGCGG ACGGCGTCCT GTACAACGGC AAGGTCGACG GCATGACCTC CGACGACACG
GTTGCCGACC TGCTGGCGAA GGCGGGCTTC ACCGCCGCGG CCAGCGCGGA GGAGACCGAG
GGGAACGACA AGGCGTACTT CGACTCCTGG GGCTCCCCGA CGTTCCGCGG CAACAAGTCG
GTCCAGCAGC CCGACGGCTC GTGGGCCTAT TGGGTGACGA TGTTCGACGG CGACAGCGCG
AACTACGCCA GCGCCCAGCT GACGTCGAAG CTGCAGGAGA ACGGCCGCTA CCAGTACATC
TACACGTCCG ACGCGACGTT CGCCTACGAC GAGGAGATTC CTCAGCTCGC GATCTACACC
GTCAACGACC CCTTGGCGGG GGCGATCAAG CCCGATCCCA AGCCCGATCC CAAGCCCGAA
CCCGAACCTG ATGAGCCTGC CAACGATGCC ATCGCAGTAG ACAGCGCTGC GTACAACACG
CTGTTCGGCA CCATCGCCAG TTCCTATGCG GGTACGTCCG AGGAATGGAA AGCCCTTGAG
CTCGCGGCCG CCGGTCGTGT GTCGTCGGTT GACGTGGCGA CGCTCGTGGC GAATGCGAAA
GCGGCGAACG GTTCTCCCGA AACCACCAAT TTGCAACGCT TCATCTTGGC GCTCACGGCC
GTCGGCAAAA CAGAGGAAGC AGCGGAGCTC GTGCAAACGA TGGCGACGTC CGACATTTCG
ACTACCTATG TGAACGGTCA GGCGTTTGCT CTGCTCTCCT ACGAAAGCGG TGCGTACGAC
GCGCCCGCGA ACGCTCTTGA AACCGAAGCT GAGCTTGTGG CCAAGCTTCT GAGCGCGCAG
CAGGCTTCCG GCGGCTGGAC CTGGAAAGGC GCTGCCGAAG GGGACGATCC TGATACGACG
GCCATGGTGA TAACCGCTCT TGCCTCTCGC GTTTCGGACG CCTCCGTCAA AGCGGCGGTC
GACAAGGGTC TCGAGGCGCT GCGCGCAATG CAGCACGAGG ACGGCGGTTT CCGCGCATCC
GGGGACGCGG CCGATGGTCC CATCAACGTC AGCTCTACGT CCTGCGTCGT GGTCGCCCTG
TGCGCCTTGG GCGCGGATCC GGCGGCATCC ATGGTCACCG AAAGCGGCGC GACGCCGTTG
AGCGCGCTGC TCTCGCAGGC CACTTCCGAC TTGTCCGGAT TCGTTTACAA CGGCGCTGCG
AACGACCTCG CCACCGAGCA GGGGTTCCGG GCGCTTGTTG CGTACCAGGG CCTCAAGAAC
ACCGGGGCGG CGTACAACGT CTACACGCAG GCGAAGCTCG GCCAGGCTGC GCTGCCCGCC
GAGAAGCAGG AAGAAAGCGA CGTCAAGCCC GCAGGGGCTC CGGCGGCCGA CAAGAAGGCG
CTCGCCAAGA CCGGGGACGG CTCTGCGCCG TTCGCGGCCG GCACTGCCGC GCTCGCGCTC
GGCGCGCTCG CGGCGGGCAT CGCCGCCACG CGGCGCATGC GCGCTTCCGA TGAGCTCTCG
TTGCGCCGAT AG
 
Protein sequence
MKEQTIIPAG VMRRAWALLL AAALCLGLMP SAAWAEESEG AGTFSVALTI VDTSDPADGV 
LYNGKVDGMT SDDTVADLLA KAGFTAAASA EETEGNDKAY FDSWGSPTFR GNKSVQQPDG
SWAYWATMFD GDSANYASAQ LTSKLQENGR YQYIYTSDAT FAYDEEASGF PVQLTIVDTS
DPADGVLYNG KVDGMTSDDT VADLLAKAGF TAAASAEETE GNDKAYFDSW GSPTFRGNKS
VQQPDGSWAY WVTMFDGDSA NYASAQLTSK LQENGRYQYI YTSDATFAYD EEIPQLAIYT
VNDPLAGAIK PDPKPDPKPE PEPDEPANDA IAVDSAAYNT LFGTIASSYA GTSEEWKALE
LAAAGRVSSV DVATLVANAK AANGSPETTN LQRFILALTA VGKTEEAAEL VQTMATSDIS
TTYVNGQAFA LLSYESGAYD APANALETEA ELVAKLLSAQ QASGGWTWKG AAEGDDPDTT
AMVITALASR VSDASVKAAV DKGLEALRAM QHEDGGFRAS GDAADGPINV SSTSCVVVAL
CALGADPAAS MVTESGATPL SALLSQATSD LSGFVYNGAA NDLATEQGFR ALVAYQGLKN
TGAAYNVYTQ AKLGQAALPA EKQEESDVKP AGAPAADKKA LAKTGDGSAP FAAGTAALAL
GALAAGIAAT RRMRASDELS LRR