Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1600 |
Symbol | nfrB |
ID | 8415899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1898193 |
End bp | 1900328 |
Gene Length | 2136 bp |
Protein Length | 711 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645024569 |
Product | bacteriophage N4 adsorption protein B |
Protein accession | YP_003181957 |
Protein GI | 257791351 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.443209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00000043363 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACGAGC AGGTCGTCTA TTGGATCGGC TTCTTCGTTG CGCTCGCGTT CATCGTGTTC GGCGCGGACG ACGTGCTGTG GGACGTGTTC GCGCTGTTTC GCGGCACGGG CAAGAAGCGC GTGAAGCTGT CGCTCATCAA CGAGAAGCCG CCGAAGATGC TGGCCGTGGT CATCGCCGCC TGGCACGAGG ACGCCGTGCT GGGCGAAGTG GTGGACAACC TCGTGGCCTC CGCGCAGTAC CCGCGCTCGC TGTACCGGGT GTTTTTGGGC GTCTACCCCA ACGATGCGGC TACCGTGGCC GTGGCTCGCG CGCTCGAGGT GCGCCATGGA GGCACCGTGG TGTGCGTGGT GGGAGACGAT CCCGGACCCA CGTCGAAGGC CGCCAACATC AACCATACCG TGCGGGCCAT CCGGGAATAC GAGGCCGAGC GCGACGTGCG CTTCGCGAGC GTCACCATCC ATGACGCGGA GGACGTGGTG CACCCCAACG AGTTCAAGAT GACGAACTAC TTGATCGACG ACTACGACGC CCTCCAGTTC CCCGTGTTCC CGCTGCAGCG CATGCCGCGG CTGCGGCTGT TCTTCAAGAC GTTGACCTCG TCCACCTACG CCGACGAGTT CGCCGAGCAT CACTTCCGCA CCATGGTCAT GCGCGACGAG CTGGGCTTCG TTCCTTCGGC GGGCACGGGC TTCGCCATCG GCCGGCGCGT CCTCGACGCG TTTCGCGACG AGGACCTGCT GCCGCGCAAC AGCCTGACCG AGGATTACAA GCTGTCGCTC ACCTTGCGCA TGCGCGGCTT CCGCGTGCAC TACGTGCTTG AGAAGGTGCC GCGCGTCGAC GCGCGCGGGC GCACCGTGTG GGACTACATC GCCACGCGCT CGCTGTTCCC GTCGACGTTC AAGGCGGCGG TGCGGCAGAA AGCGCGATGG GTGTACGGCA TCACGATGCA GAGCGCGAGC ATGGCCGATG TGTTCGGCAA AAGCGAGCTT ACGTTCGCCG AGCGCACGTT TCTGTACAAG GGCCTCAAGG CGAAGTTCGC GAACTTCGTG CTGCTGCCGG GCTACGCCGT GCTCGCGTAC TTCCTCGTGC AGACGTTCGC GCCGCAGCTG GAGCTGCCCG TCATGTACCC CTTGCACAGC CCTTCGTGGT GGATGTGCGT GTTTCTGCTG TTCATGATGG TGGAACGGCA GGTGCTTCGC GGACGCGCGC TGGCGAACGT GTACGGCTGG AAGACGATGG CGTTCTCCAT CCTGCTGCCG CCGCTGTTCC CGCTCAGGCT TCTATGGGGC AACCTCATCA ACATGTGCGC GACGTTTCGC GCATGGCGGC AGAAGATCGC CTACGTGCTG CTGCGCGGAC GAGAGGCCAA GGCGGCCGCC GCGCCGGTCG TCGAGCATCG GGGCAACGCG GCAGAGGAGG AGGGCGAAAG AAAGCCTGCA ACGGACGGAG ACGAGGCGCA AACCTCGAAT GCGACGTCTG CGCAGGAAGG TCCCGCATGG AACAAGACCG ACCACGAATT TCTCCCCGCT TCGGTGCTCG AACGCTATCG GCGCCTTCTG GGCGACGCGC TGCTGGAGAG GGGTTTCGTG GAGCCAGGGC ATCTGGAAGA CGCCGTGGGA TCGGCGCGTG CGCGCGGCGT GCGGTTGGGG CAGGAGCTGC TGAGGCAGGG ATTGGTCGAA GAGAGGCACC TCACGCAGGC TTACGCCTTG CAGCAGCAGT CGATGTACGT GCGTGCACAG CCCGACCTCG TGCTTCTGGA GCTGATGGAT CGCATGCCGT TCGCCGCGGC GGATCGGTTC GCCGCGCTGC CGCTGGTCGA GAGCGAAAAA GGATGGATCG TCGCCGTGGA CGACGACCTT TCTTGCGCGG AGCGAGACGA ACTGGCGTTT CTGCTGGGCG AACCGACGTT CTTCCTGTTC TCCAGCACAG CCGACCTGCT CGAAGCGTTC GAAGGCGCTC TCGCGTTCGA CAACGCGGCG GAAGCTCCGC AACCTGCCGG GGCGGCGACG CTTCTGGAGG AGACGAGCGT AGAGCTGCCA CAGGCGGGCA TGGCGCTAGC TTACGCGCTG CATCTCGGCC GCTCGGTCGA CGACATCGCT TGCGAGATGG GCCTCGCCGT GTCGCGTTTC TCCTAG
|
Protein sequence | MDEQVVYWIG FFVALAFIVF GADDVLWDVF ALFRGTGKKR VKLSLINEKP PKMLAVVIAA WHEDAVLGEV VDNLVASAQY PRSLYRVFLG VYPNDAATVA VARALEVRHG GTVVCVVGDD PGPTSKAANI NHTVRAIREY EAERDVRFAS VTIHDAEDVV HPNEFKMTNY LIDDYDALQF PVFPLQRMPR LRLFFKTLTS STYADEFAEH HFRTMVMRDE LGFVPSAGTG FAIGRRVLDA FRDEDLLPRN SLTEDYKLSL TLRMRGFRVH YVLEKVPRVD ARGRTVWDYI ATRSLFPSTF KAAVRQKARW VYGITMQSAS MADVFGKSEL TFAERTFLYK GLKAKFANFV LLPGYAVLAY FLVQTFAPQL ELPVMYPLHS PSWWMCVFLL FMMVERQVLR GRALANVYGW KTMAFSILLP PLFPLRLLWG NLINMCATFR AWRQKIAYVL LRGREAKAAA APVVEHRGNA AEEEGERKPA TDGDEAQTSN ATSAQEGPAW NKTDHEFLPA SVLERYRRLL GDALLERGFV EPGHLEDAVG SARARGVRLG QELLRQGLVE ERHLTQAYAL QQQSMYVRAQ PDLVLLELMD RMPFAAADRF AALPLVESEK GWIVAVDDDL SCAERDELAF LLGEPTFFLF SSTADLLEAF EGALAFDNAA EAPQPAGAAT LLEETSVELP QAGMALAYAL HLGRSVDDIA CEMGLAVSRF S
|
| |