Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1510 |
Symbol | |
ID | 8415808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1800238 |
End bp | 1803306 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645024478 |
Product | Cna B domain protein |
Protein accession | YP_003181867 |
Protein GI | 257791261 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.681138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAGA AGACGATGGA GAGGATAACC TCCTCGAAAA TCGTGCGGGT CGCCATGGCG ATGGCGCTCG TGCTGTCCGT CGCCATCGTC CCGACCAAGG CATATGCCTC CGGGAACGTG AACGTCTCGA TCGGCAAGAG CATCCCCTAT GCCGGATACG AGACGACCCA GATGAGCGCC AACGGCAACG ACGCCTACTG CATCGAGCCG TCGCGCTCCA CGCCCGATGC GGGAACCTAT CCGACGAGTG AGGCGGGAGA TCTGGCGGCG GCCATGTGGT TCTCCTATGG GGCACCCGGC TTCGACGCGT CCATGTGGCC CAGCTCCTGG TACGACGGAA GCGGCATGGA CGAGGACAAG TACCGCGTGG CGAGCCACAT CCTGCTCTCG TACGCCAACC TGGGATCTGC CGCCGAGGCG ACCTACGGCA CGAGCGCCGA GTTCGCCTCC TGGGCGGAGC GCGAGATCGC GGGCGACGTG TGGAGCCAGG TCAACGCACG TGCGAACGAG GTCTCCACGG GATTCTCGGC CATCAGGATC CATGCGGGAT CAGACGCCCA GACGCTCGCC AGCTTCACGT GGGAGCGCGG CGGCGTGAAG ATCGCCAAGG TCGATGCGCA AGCCGGCGCA GGCGAGCAGG GCGACGCCTC GCTCGAGGGC GCCGAGTTCG CCATCGTCAA CGCATCGGGG ATGAACTCCT ACGTGAACGG CCACAGCTAC GCAGACGGCG AGACGGTCAT GACCATCTCC ACCTCCTGGG ACGGCTCGGC CTACACCGCG CAGACCGCGA GCGACGCGCT GCCGTGCGGC ACCTACCGCA TCGTTGAGGC GAACGCTCCG GAAGGCTACC TTCCCTGGGA AGGCGAGCTC GGGTTCGCCA TCGAGGGCGA CGGCCAGGTC GTCGACCTTT CCGGCGACCC CGTGCATGAC GACGCGATAC GCGGCGGCGT GCAGGTGACC AAGGCCGACG CGGAGCTCGG CGAGTCCGAG GCGGTCGGCG GGAACGGGCA TGCAGCAGAA GGCGTCGGCA CGACGCTCTC CGGCATCGAG TTCGCCATAA CGAACGCCTC CGAGAACAAG GTCCTCGTAG GCGATGCCTG GTACGAGCCG GGCGAGGTCG TCGCCGCCAT CGAGACCGAG TGGGACGAGG AAATCGGCTC CTACATCGCG AAGACCGCGC CCGACGCGCT GCCCTACGGC ACCTACACCA TCCAGGAGAC CGCGACCAAC GACTCCTACC TGCTCACCGA CGGCGAGCCG AAGACCTTCG AGATCCGGAC CGACGGCCTT GTCGTGACGG CGGACGCGGA AGGCGGCGAG CTCACCTTCT ACAACCAGGT CGTCCGCAAC GACCTCAAGC TCTCCAAGAA GGCCGAGGAC ACCAACGCGA GCCTGCAGGT TCCCTTCGCC ATCTCCAACG TGGCGACCGG CGAGACGCAC GTGCTGGTCA CCGACCGCAA CGGCCAGGCT TCGACCGAGG CGAGCTGGAA CAGGCACACC GCCAACACCA ACGGAAACGA CGTCCTGCTC GAAGCCGACC GCATCACGGC CGGTATGATG GACCCGTCGG CGGGCATCTG GTTCGGGCTG GGCGAGGACG GCTCGTCGGC TCCCGCCAAC GACGCGCTCG GCGCGCTGCC GTACGGCCAG TACACGCTCG AGGAGCTTCC CTGCGAGGCG AACGAGGGCT ACGAGCTCGT GACCAAGGCC TTCTGGATCG AGCGCGACTC CGGCGTGGCG GAGGCCGTCT GGATGACGCT CGACGACCAG GAGGGGCCGA GGATCGGAAC GCGGGCAACC GAAGTGGCCG ACGGTGACCA GATCGCCCAG GCAAACGAGC AGACGACGAT TGTGGATACC GTCTACTACG AGAACCTCGA GTTCGGAGGC ACGTACACCC TCACGGGCAC GCTCATGGTC AAGTCCACCG GCGAGCCGCT GCTCGACGCC GAGGGCAATC CCGTGACCGC CACCAAGGAG TTCACGGCGA ACAACACGAA CGGGTCGGTG GACATCGAGT TCACCTTCGA CGCGAGCCTG CTCGCGGGCG AGGACGTCGT GGCCTTCGAG AGCCTCGTGA AGGATAGCAT CGAGGTAGCA GTCCACGCAG ATATCGAGGA CGAGGGCCAG ACGGTCCATT TCGTCGACAT CGGCACCACG GCGGCCGACG CCGCCGACGG CGACAAGCTC GTGACGGGCT CCGAGGTCAT CATCGCTGAC GAGGTGGCCT TCGAGGGTCT GACCCCTGGA GGCTCTTATA CGCTCGAGGC GACGCTGATG GATGCCGAGA CGGGCGAGCC GCTGAAAAGC GGCGAGGGGC TTCTCGCGAC TGACGTGGCC GCGACCGTCG AGTTCACGCC CGAAGCTGCC GAGGGTACCC AGACGGTAGA GCTCTCATTC GATTCCTCCG GCCTCGGCGG TCACCGCCTG GTGGTGTTCG AGAAGCTGCT CGACGCCGAA GGCACCGTCC TCGCGGTGCA CGAGGATATC GAGGACGAGG GCCAGTCCGT CACGGTCGTC GAGATCGGCA CCACGCTCGT CGACGCCGCC GACGGCGACC ACATGGTCGA GAACGGGACG GTCACCGTCG TGGATACCGT CGAGTACAAG GGGCTCGTCG CAGGAGAAAC CTATACTGCC CACGGCACCA TCATGGACAA GGCGACTGGC ATGCCCCTTG AGGACTCGGA AGGCAATCCG GTGACCTCAA CCGCGGAGTT CGTGGCGGAA AGTTCTGAGG GAACCGTAGA GATCACCTTC GAGTTCGATG CTTTCCAGCT CGAGGAGGGC GCCTCCCTCG TGGCGTTCGA GGAGGTGCTC GACGTGAACG GGAACGTCAT CGCGGTACAT CAGGACCTCG AGGACGAGGG GCAGACCGTG GTCGTCGACA ACCCCGAGAC TCTCGGCACC CCCTACGACA AGACGGGAGG CGACCTGCTT CCCGTATGGG TTCTGATCAG CGCCTTGATC CTCTGCGGCG GCGCTGCGGG CGCATACGCG CTTCGCGGCC GCATCCGTCG AAACGCATCA GTTGGCGAGG GATCCACTGA CGAGGGTCCC GAGAAGTAG
|
Protein sequence | MKEKTMERIT SSKIVRVAMA MALVLSVAIV PTKAYASGNV NVSIGKSIPY AGYETTQMSA NGNDAYCIEP SRSTPDAGTY PTSEAGDLAA AMWFSYGAPG FDASMWPSSW YDGSGMDEDK YRVASHILLS YANLGSAAEA TYGTSAEFAS WAEREIAGDV WSQVNARANE VSTGFSAIRI HAGSDAQTLA SFTWERGGVK IAKVDAQAGA GEQGDASLEG AEFAIVNASG MNSYVNGHSY ADGETVMTIS TSWDGSAYTA QTASDALPCG TYRIVEANAP EGYLPWEGEL GFAIEGDGQV VDLSGDPVHD DAIRGGVQVT KADAELGESE AVGGNGHAAE GVGTTLSGIE FAITNASENK VLVGDAWYEP GEVVAAIETE WDEEIGSYIA KTAPDALPYG TYTIQETATN DSYLLTDGEP KTFEIRTDGL VVTADAEGGE LTFYNQVVRN DLKLSKKAED TNASLQVPFA ISNVATGETH VLVTDRNGQA STEASWNRHT ANTNGNDVLL EADRITAGMM DPSAGIWFGL GEDGSSAPAN DALGALPYGQ YTLEELPCEA NEGYELVTKA FWIERDSGVA EAVWMTLDDQ EGPRIGTRAT EVADGDQIAQ ANEQTTIVDT VYYENLEFGG TYTLTGTLMV KSTGEPLLDA EGNPVTATKE FTANNTNGSV DIEFTFDASL LAGEDVVAFE SLVKDSIEVA VHADIEDEGQ TVHFVDIGTT AADAADGDKL VTGSEVIIAD EVAFEGLTPG GSYTLEATLM DAETGEPLKS GEGLLATDVA ATVEFTPEAA EGTQTVELSF DSSGLGGHRL VVFEKLLDAE GTVLAVHEDI EDEGQSVTVV EIGTTLVDAA DGDHMVENGT VTVVDTVEYK GLVAGETYTA HGTIMDKATG MPLEDSEGNP VTSTAEFVAE SSEGTVEITF EFDAFQLEEG ASLVAFEEVL DVNGNVIAVH QDLEDEGQTV VVDNPETLGT PYDKTGGDLL PVWVLISALI LCGGAAGAYA LRGRIRRNAS VGEGSTDEGP EK
|
| |