Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0796 |
Symbol | |
ID | 8415086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 990757 |
End bp | 994527 |
Gene Length | 3771 bp |
Protein Length | 1256 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645023762 |
Product | Cna B domain protein |
Protein accession | YP_003181159 |
Protein GI | 257790553 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAAA GCAAGGAAAC AGCCCGAGGG GCGAGCCTCG AAGATGTGCG CGGGAAGCTC GCTCGCTGCC TGCTCGCCCT CTTGCTCGCA GCGTCGAGCG TTGCTGGTGC GCTATCATCG CTCGGCGCGA AGGAGGCGCA CGCGGCGGAA ACCGCTTACC TGAGCGTGGG CGGCAACATC CCCTATGCGG GGTTCTTCAC CACGTGGATG TGGGCAGACG ACCAGATGGC CTACTGCGCC CAGCCCTCCA AGGCGACGCC TTCCGAGGGC GGCTACGAGA AGGCGCCTTT GAGCACCGCG TCCGGCCGCG ACGCCGAAGC CGCCGCCGAC CTGTGGTTCG GATGGGGCGG CCCGGGCTTC GACTACTCCA TGTGGCCCGG CGCCTGGTAC GACGGCACGC CGATGAACGA CGCGCGCTAC GCAGCGCTCA CGCACATCAT CCTGTCCGAC ACGTACAGCT CGAGCGGCGA CCAGGCCATG CATGGGTGCA CCCAGGCGTT TCGCTCGTGG TGCCAGCAGA ACGTGCTCGG CTTCGACGAC GGCGGCTCGG TGATCAACGG CGACGCCACG GGACGCCTCA TGCACTCGCG CATGGGCGAG GTCAAGCAGT CCTTCAAGGC GTTCCAGCTC TACACGGGAA GCTTCTCGCA GATGATCGTC TCGTTCAGCT ACACTCCCTA CGGCAGCGTC GAGCTTGAGA AGCAGTCGGG CAACGATGCC ATGAGCAACG GCAACGACGC GTACTCCCTT GCGGCCGAGT ACACGCTCTA TTCGGACGAG GCCTGCACGC AGGCGGTGTC CGTCATGGCT TTAGACGACT CCGGCCGCGG CAGGATCGAC GAGGTTGAGC CCGGCGACTA CTGGCTCAAG GAGTCTAAGG CCGCACCTGG GTTCGCCATC GACGAGGCCG CCTACGCGGT GACGGTCGAG CCCGACAAAA CCGCCACCGT TGCGGCCGGG CACGTTCAGG ACACGCCGCA GTCGAACGCC GTCGACGTCG TCGTGGCCAA GGCAGACGCC ACCACTGAAA AGGCACAGCC CCAGGGCGAT TCCGCTCTCA GCGGCGCCGA GTTCACCGTC GAGTACTACA AGGGGATCTA CTCCTCGGCC GACGAGGCCC GCGCATCGGG CTCGCCTGAG CGCACCTGGG TCCTTGCGAC CGACGACGAG GGCAAGGCCC ACCTGAGCAG CGATTCCAAG GTGTCCGGCG ATGCGTTCTA CTACGCATCC GACGGGGCGA CGCCGGTCAT CCCGCTCGGC ACCGTCCTCG TGACCGAAAC GAAGGTGCCC TGCGGCTACA ACCTCGACGA CGGCCGGGGC AACGCGCCCG AAACGCATCT TGCGAAGGTT GAGTCGGACA ACGATCGCAT CGAAGCCGTT TCGACCTACA ACTCGCCTGT GGCGAAGGAC ACCGTGAAAC GCGGCGACTA CCGCCTCGTC AAGGAGGTGC CGGTATCGAT CTACTCCGAG GGCGCAGGCG ACATGCCGCA AGACGCCAAG CGCGTGCTCG TCCCCGGCGT CGAGTTCCAG CTGATCAACG ACTCGACCAA CCCGGTCACA TCCCCTGAAA CCGGCGAGGA GGTCGCGCCC GGCGGCGTCG TGTGCACCAT AACGACCGAC GAGAACGGGC TAGCCACCAC CAAGGACGAG AACGCCGCCG CCAACGGATG GTCGAAGCCC GAGGGCTGGA GCGCCGCGCT GGCGTACGGC ACCTACACCG TGCACGAGGT CATCCCTGCA GACGTCGCCG CGAAGTTCAA GACGGAATAC GGCAAGGCCC TTCTTCCCGT CGAGGACTTC AAGATCACGA TTTCCGACGA AGGACAGTAC GACCCGCCGG TTCTGGTGAG CAACAAGATC CCGCAAACGC CGCTCAAGGT GGTCAAGGCC GACGCCGAAA CCGGCAAGCG GATCCCGCTT GCGGCGAGCT TCGAACTGTA CGACGCCTAC GGTGGTCTCG TCACCTATAC CCTCCACTAC CCGGACGAGG AGGTCGTGAG CACGTGGACC ACCAACGAGC GCGGCGAGCT CACCTTGCCG ATGGCTTTGG GGGAGGGGTG TTACTTTTTG AAGGAGGTCG CAGCGCCCGA GGGCTACGTC CTTGATTCCG AGCCGGCAGC CTTTGCGGTT TCCGCCGACT ACCGCGGATG GGACGACCCG ATCGTCCTGA CTTTCGAGGA CGCGCCCCAG AAAGGCACCG TAACCGTGTC GAAGACCAAC TCCGAGACGG GGTCGCCCGT CGACGGCTCC ACCTATATCG TGAAGGCCGA AGGCGATGTC TCCACCCCCG ACGGGATCCT GCGCTACGCA GACGGGCAGA TCGTAGCGAC GCTCACCACC GACGCTGAGG GACACGCGAC TAGCGAACCG CTCTACCTGG GCACCTACAC CGTCTACGAG GCGAAGGCCA AAGACGGCTA CGCCCTCGAC GTGGCCGAGA ATACCGTTGC CTTGACCTAC CAGGGCCAGG AGGCCTCGGT GTTCGACGAG CGAATCGACG TGGCAGACGC CCCGACCGAG TTGCGCCTGG TCAAGGTGGA TTCGCTCGAC GGTGAAACGC CTATCGAGGG CGCCGTTTTC CGCATCTGGA ACGACGCGGG CGATTTCGAT GAGACCTTGA CAACCGACGA AAGCGGCGTC ATCGACCTCA AGTACCTGAA GCACGGCAGC TACCACCTGC AGGAGGTAGC CGCGGCGGAG GGCTACGTGA TCAGCGACGT CGACGAAGAC GGCAACGCCA AAACCCACGA TTTCGAGGTG AACGACCAGG GCATGGCCAC GCTCGACGGC GATTCCATGC AAGCCAAGCT CGCCATCGTG GTCGAGAACA TGCCCAAGAC CATGGGCACC ACCGCGGCCG ACGGCGACGG CGGCACCCAT GAGGGCCAGG CGCGCAGCGA CATGTCGATT ATCGACAGCG TCGCCTACAC GGGATGCATC CCCGGCGAAG CCTACAAAGT AACTGGCAAG CTCATGGACA AATCCACCGG CCAGCCCGCG CTCGACGCCG AGGGCAACGA GATCACGGCC GAGAAGGATT TCGTAGCGGA GGGTTTCGAG GGGTCGATTG ACATCGAGTT CAGATTCGAC GGATCCGGCC TCGCAGGCGC CTCGCTCGTT GCGTTCGAGG CGATGTACGA CGCCGAAGGC AGCATCTATA TGAATCATGA GGATATCGAC GATGAGGGCC AGACCGTCAA CGTCGTCGAC ATCGCAACAA AGGCCCACGA TGCGGAAACC GGCACAAACC AGGGCACCGT GAGCGAATCC GCGACGCTCG TGGACGTGGT GAGCTTCGAG GGGCTGACCC CCGGCAACCG CTACAAGCTG TTCACCATGC TGGTGGACAA GGCGACCGGC GAGCCCGTTG AGGATGCCGC CGGCAACCCG ATGGTGATCG AGACGGATTT CGCGCCCGAG GCGCCCGACG GCACCGTTGA AGTCGTGTTC GAGCTCGACA CCGCTGACTT GGCCGGCAAG TCCCTCGTGT TCTTCGAGAA GCTCGCCGAC GACGGCGACA ACGTCATCGC GAAGCACGAG GACATCGGGG ACGAGGGCCA GACCATCGAG CTTCCGGAGC CTGATGCTCC CCAAAACCCC GTAGGCAAGG GATACCCCAA GACGGGCGCC GATGCGGCCA AGGCGGCCGT TGCGGCAAGC GCGGCGGTGA TCGTGGGCTG CGGTGCGGCC GGCGCGGCCT ATGCCGCGGC GAAGCGCCGC AAGAAGGCCG GCGAAGGGGT TGAGGAACCG ACCGCCGAAC CTGTCGAGTA G
|
Protein sequence | MDESKETARG ASLEDVRGKL ARCLLALLLA ASSVAGALSS LGAKEAHAAE TAYLSVGGNI PYAGFFTTWM WADDQMAYCA QPSKATPSEG GYEKAPLSTA SGRDAEAAAD LWFGWGGPGF DYSMWPGAWY DGTPMNDARY AALTHIILSD TYSSSGDQAM HGCTQAFRSW CQQNVLGFDD GGSVINGDAT GRLMHSRMGE VKQSFKAFQL YTGSFSQMIV SFSYTPYGSV ELEKQSGNDA MSNGNDAYSL AAEYTLYSDE ACTQAVSVMA LDDSGRGRID EVEPGDYWLK ESKAAPGFAI DEAAYAVTVE PDKTATVAAG HVQDTPQSNA VDVVVAKADA TTEKAQPQGD SALSGAEFTV EYYKGIYSSA DEARASGSPE RTWVLATDDE GKAHLSSDSK VSGDAFYYAS DGATPVIPLG TVLVTETKVP CGYNLDDGRG NAPETHLAKV ESDNDRIEAV STYNSPVAKD TVKRGDYRLV KEVPVSIYSE GAGDMPQDAK RVLVPGVEFQ LINDSTNPVT SPETGEEVAP GGVVCTITTD ENGLATTKDE NAAANGWSKP EGWSAALAYG TYTVHEVIPA DVAAKFKTEY GKALLPVEDF KITISDEGQY DPPVLVSNKI PQTPLKVVKA DAETGKRIPL AASFELYDAY GGLVTYTLHY PDEEVVSTWT TNERGELTLP MALGEGCYFL KEVAAPEGYV LDSEPAAFAV SADYRGWDDP IVLTFEDAPQ KGTVTVSKTN SETGSPVDGS TYIVKAEGDV STPDGILRYA DGQIVATLTT DAEGHATSEP LYLGTYTVYE AKAKDGYALD VAENTVALTY QGQEASVFDE RIDVADAPTE LRLVKVDSLD GETPIEGAVF RIWNDAGDFD ETLTTDESGV IDLKYLKHGS YHLQEVAAAE GYVISDVDED GNAKTHDFEV NDQGMATLDG DSMQAKLAIV VENMPKTMGT TAADGDGGTH EGQARSDMSI IDSVAYTGCI PGEAYKVTGK LMDKSTGQPA LDAEGNEITA EKDFVAEGFE GSIDIEFRFD GSGLAGASLV AFEAMYDAEG SIYMNHEDID DEGQTVNVVD IATKAHDAET GTNQGTVSES ATLVDVVSFE GLTPGNRYKL FTMLVDKATG EPVEDAAGNP MVIETDFAPE APDGTVEVVF ELDTADLAGK SLVFFEKLAD DGDNVIAKHE DIGDEGQTIE LPEPDAPQNP VGKGYPKTGA DAAKAAVAAS AAVIVGCGAA GAAYAAAKRR KKAGEGVEEP TAEPVE
|
| |