Gene Elen_1510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1510 
Symbol 
ID8415808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1800238 
End bp1803306 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content66% 
IMG OID645024478 
ProductCna B domain protein 
Protein accessionYP_003181867 
Protein GI257791261 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.681138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGA AGACGATGGA GAGGATAACC TCCTCGAAAA TCGTGCGGGT CGCCATGGCG 
ATGGCGCTCG TGCTGTCCGT CGCCATCGTC CCGACCAAGG CATATGCCTC CGGGAACGTG
AACGTCTCGA TCGGCAAGAG CATCCCCTAT GCCGGATACG AGACGACCCA GATGAGCGCC
AACGGCAACG ACGCCTACTG CATCGAGCCG TCGCGCTCCA CGCCCGATGC GGGAACCTAT
CCGACGAGTG AGGCGGGAGA TCTGGCGGCG GCCATGTGGT TCTCCTATGG GGCACCCGGC
TTCGACGCGT CCATGTGGCC CAGCTCCTGG TACGACGGAA GCGGCATGGA CGAGGACAAG
TACCGCGTGG CGAGCCACAT CCTGCTCTCG TACGCCAACC TGGGATCTGC CGCCGAGGCG
ACCTACGGCA CGAGCGCCGA GTTCGCCTCC TGGGCGGAGC GCGAGATCGC GGGCGACGTG
TGGAGCCAGG TCAACGCACG TGCGAACGAG GTCTCCACGG GATTCTCGGC CATCAGGATC
CATGCGGGAT CAGACGCCCA GACGCTCGCC AGCTTCACGT GGGAGCGCGG CGGCGTGAAG
ATCGCCAAGG TCGATGCGCA AGCCGGCGCA GGCGAGCAGG GCGACGCCTC GCTCGAGGGC
GCCGAGTTCG CCATCGTCAA CGCATCGGGG ATGAACTCCT ACGTGAACGG CCACAGCTAC
GCAGACGGCG AGACGGTCAT GACCATCTCC ACCTCCTGGG ACGGCTCGGC CTACACCGCG
CAGACCGCGA GCGACGCGCT GCCGTGCGGC ACCTACCGCA TCGTTGAGGC GAACGCTCCG
GAAGGCTACC TTCCCTGGGA AGGCGAGCTC GGGTTCGCCA TCGAGGGCGA CGGCCAGGTC
GTCGACCTTT CCGGCGACCC CGTGCATGAC GACGCGATAC GCGGCGGCGT GCAGGTGACC
AAGGCCGACG CGGAGCTCGG CGAGTCCGAG GCGGTCGGCG GGAACGGGCA TGCAGCAGAA
GGCGTCGGCA CGACGCTCTC CGGCATCGAG TTCGCCATAA CGAACGCCTC CGAGAACAAG
GTCCTCGTAG GCGATGCCTG GTACGAGCCG GGCGAGGTCG TCGCCGCCAT CGAGACCGAG
TGGGACGAGG AAATCGGCTC CTACATCGCG AAGACCGCGC CCGACGCGCT GCCCTACGGC
ACCTACACCA TCCAGGAGAC CGCGACCAAC GACTCCTACC TGCTCACCGA CGGCGAGCCG
AAGACCTTCG AGATCCGGAC CGACGGCCTT GTCGTGACGG CGGACGCGGA AGGCGGCGAG
CTCACCTTCT ACAACCAGGT CGTCCGCAAC GACCTCAAGC TCTCCAAGAA GGCCGAGGAC
ACCAACGCGA GCCTGCAGGT TCCCTTCGCC ATCTCCAACG TGGCGACCGG CGAGACGCAC
GTGCTGGTCA CCGACCGCAA CGGCCAGGCT TCGACCGAGG CGAGCTGGAA CAGGCACACC
GCCAACACCA ACGGAAACGA CGTCCTGCTC GAAGCCGACC GCATCACGGC CGGTATGATG
GACCCGTCGG CGGGCATCTG GTTCGGGCTG GGCGAGGACG GCTCGTCGGC TCCCGCCAAC
GACGCGCTCG GCGCGCTGCC GTACGGCCAG TACACGCTCG AGGAGCTTCC CTGCGAGGCG
AACGAGGGCT ACGAGCTCGT GACCAAGGCC TTCTGGATCG AGCGCGACTC CGGCGTGGCG
GAGGCCGTCT GGATGACGCT CGACGACCAG GAGGGGCCGA GGATCGGAAC GCGGGCAACC
GAAGTGGCCG ACGGTGACCA GATCGCCCAG GCAAACGAGC AGACGACGAT TGTGGATACC
GTCTACTACG AGAACCTCGA GTTCGGAGGC ACGTACACCC TCACGGGCAC GCTCATGGTC
AAGTCCACCG GCGAGCCGCT GCTCGACGCC GAGGGCAATC CCGTGACCGC CACCAAGGAG
TTCACGGCGA ACAACACGAA CGGGTCGGTG GACATCGAGT TCACCTTCGA CGCGAGCCTG
CTCGCGGGCG AGGACGTCGT GGCCTTCGAG AGCCTCGTGA AGGATAGCAT CGAGGTAGCA
GTCCACGCAG ATATCGAGGA CGAGGGCCAG ACGGTCCATT TCGTCGACAT CGGCACCACG
GCGGCCGACG CCGCCGACGG CGACAAGCTC GTGACGGGCT CCGAGGTCAT CATCGCTGAC
GAGGTGGCCT TCGAGGGTCT GACCCCTGGA GGCTCTTATA CGCTCGAGGC GACGCTGATG
GATGCCGAGA CGGGCGAGCC GCTGAAAAGC GGCGAGGGGC TTCTCGCGAC TGACGTGGCC
GCGACCGTCG AGTTCACGCC CGAAGCTGCC GAGGGTACCC AGACGGTAGA GCTCTCATTC
GATTCCTCCG GCCTCGGCGG TCACCGCCTG GTGGTGTTCG AGAAGCTGCT CGACGCCGAA
GGCACCGTCC TCGCGGTGCA CGAGGATATC GAGGACGAGG GCCAGTCCGT CACGGTCGTC
GAGATCGGCA CCACGCTCGT CGACGCCGCC GACGGCGACC ACATGGTCGA GAACGGGACG
GTCACCGTCG TGGATACCGT CGAGTACAAG GGGCTCGTCG CAGGAGAAAC CTATACTGCC
CACGGCACCA TCATGGACAA GGCGACTGGC ATGCCCCTTG AGGACTCGGA AGGCAATCCG
GTGACCTCAA CCGCGGAGTT CGTGGCGGAA AGTTCTGAGG GAACCGTAGA GATCACCTTC
GAGTTCGATG CTTTCCAGCT CGAGGAGGGC GCCTCCCTCG TGGCGTTCGA GGAGGTGCTC
GACGTGAACG GGAACGTCAT CGCGGTACAT CAGGACCTCG AGGACGAGGG GCAGACCGTG
GTCGTCGACA ACCCCGAGAC TCTCGGCACC CCCTACGACA AGACGGGAGG CGACCTGCTT
CCCGTATGGG TTCTGATCAG CGCCTTGATC CTCTGCGGCG GCGCTGCGGG CGCATACGCG
CTTCGCGGCC GCATCCGTCG AAACGCATCA GTTGGCGAGG GATCCACTGA CGAGGGTCCC
GAGAAGTAG
 
Protein sequence
MKEKTMERIT SSKIVRVAMA MALVLSVAIV PTKAYASGNV NVSIGKSIPY AGYETTQMSA 
NGNDAYCIEP SRSTPDAGTY PTSEAGDLAA AMWFSYGAPG FDASMWPSSW YDGSGMDEDK
YRVASHILLS YANLGSAAEA TYGTSAEFAS WAEREIAGDV WSQVNARANE VSTGFSAIRI
HAGSDAQTLA SFTWERGGVK IAKVDAQAGA GEQGDASLEG AEFAIVNASG MNSYVNGHSY
ADGETVMTIS TSWDGSAYTA QTASDALPCG TYRIVEANAP EGYLPWEGEL GFAIEGDGQV
VDLSGDPVHD DAIRGGVQVT KADAELGESE AVGGNGHAAE GVGTTLSGIE FAITNASENK
VLVGDAWYEP GEVVAAIETE WDEEIGSYIA KTAPDALPYG TYTIQETATN DSYLLTDGEP
KTFEIRTDGL VVTADAEGGE LTFYNQVVRN DLKLSKKAED TNASLQVPFA ISNVATGETH
VLVTDRNGQA STEASWNRHT ANTNGNDVLL EADRITAGMM DPSAGIWFGL GEDGSSAPAN
DALGALPYGQ YTLEELPCEA NEGYELVTKA FWIERDSGVA EAVWMTLDDQ EGPRIGTRAT
EVADGDQIAQ ANEQTTIVDT VYYENLEFGG TYTLTGTLMV KSTGEPLLDA EGNPVTATKE
FTANNTNGSV DIEFTFDASL LAGEDVVAFE SLVKDSIEVA VHADIEDEGQ TVHFVDIGTT
AADAADGDKL VTGSEVIIAD EVAFEGLTPG GSYTLEATLM DAETGEPLKS GEGLLATDVA
ATVEFTPEAA EGTQTVELSF DSSGLGGHRL VVFEKLLDAE GTVLAVHEDI EDEGQSVTVV
EIGTTLVDAA DGDHMVENGT VTVVDTVEYK GLVAGETYTA HGTIMDKATG MPLEDSEGNP
VTSTAEFVAE SSEGTVEITF EFDAFQLEEG ASLVAFEEVL DVNGNVIAVH QDLEDEGQTV
VVDNPETLGT PYDKTGGDLL PVWVLISALI LCGGAAGAYA LRGRIRRNAS VGEGSTDEGP
EK