Gene Elen_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1600 
SymbolnfrB 
ID8415899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1898193 
End bp1900328 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content65% 
IMG OID645024569 
Productbacteriophage N4 adsorption protein B 
Protein accessionYP_003181957 
Protein GI257791351 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.443209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000043363 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGAGC AGGTCGTCTA TTGGATCGGC TTCTTCGTTG CGCTCGCGTT CATCGTGTTC 
GGCGCGGACG ACGTGCTGTG GGACGTGTTC GCGCTGTTTC GCGGCACGGG CAAGAAGCGC
GTGAAGCTGT CGCTCATCAA CGAGAAGCCG CCGAAGATGC TGGCCGTGGT CATCGCCGCC
TGGCACGAGG ACGCCGTGCT GGGCGAAGTG GTGGACAACC TCGTGGCCTC CGCGCAGTAC
CCGCGCTCGC TGTACCGGGT GTTTTTGGGC GTCTACCCCA ACGATGCGGC TACCGTGGCC
GTGGCTCGCG CGCTCGAGGT GCGCCATGGA GGCACCGTGG TGTGCGTGGT GGGAGACGAT
CCCGGACCCA CGTCGAAGGC CGCCAACATC AACCATACCG TGCGGGCCAT CCGGGAATAC
GAGGCCGAGC GCGACGTGCG CTTCGCGAGC GTCACCATCC ATGACGCGGA GGACGTGGTG
CACCCCAACG AGTTCAAGAT GACGAACTAC TTGATCGACG ACTACGACGC CCTCCAGTTC
CCCGTGTTCC CGCTGCAGCG CATGCCGCGG CTGCGGCTGT TCTTCAAGAC GTTGACCTCG
TCCACCTACG CCGACGAGTT CGCCGAGCAT CACTTCCGCA CCATGGTCAT GCGCGACGAG
CTGGGCTTCG TTCCTTCGGC GGGCACGGGC TTCGCCATCG GCCGGCGCGT CCTCGACGCG
TTTCGCGACG AGGACCTGCT GCCGCGCAAC AGCCTGACCG AGGATTACAA GCTGTCGCTC
ACCTTGCGCA TGCGCGGCTT CCGCGTGCAC TACGTGCTTG AGAAGGTGCC GCGCGTCGAC
GCGCGCGGGC GCACCGTGTG GGACTACATC GCCACGCGCT CGCTGTTCCC GTCGACGTTC
AAGGCGGCGG TGCGGCAGAA AGCGCGATGG GTGTACGGCA TCACGATGCA GAGCGCGAGC
ATGGCCGATG TGTTCGGCAA AAGCGAGCTT ACGTTCGCCG AGCGCACGTT TCTGTACAAG
GGCCTCAAGG CGAAGTTCGC GAACTTCGTG CTGCTGCCGG GCTACGCCGT GCTCGCGTAC
TTCCTCGTGC AGACGTTCGC GCCGCAGCTG GAGCTGCCCG TCATGTACCC CTTGCACAGC
CCTTCGTGGT GGATGTGCGT GTTTCTGCTG TTCATGATGG TGGAACGGCA GGTGCTTCGC
GGACGCGCGC TGGCGAACGT GTACGGCTGG AAGACGATGG CGTTCTCCAT CCTGCTGCCG
CCGCTGTTCC CGCTCAGGCT TCTATGGGGC AACCTCATCA ACATGTGCGC GACGTTTCGC
GCATGGCGGC AGAAGATCGC CTACGTGCTG CTGCGCGGAC GAGAGGCCAA GGCGGCCGCC
GCGCCGGTCG TCGAGCATCG GGGCAACGCG GCAGAGGAGG AGGGCGAAAG AAAGCCTGCA
ACGGACGGAG ACGAGGCGCA AACCTCGAAT GCGACGTCTG CGCAGGAAGG TCCCGCATGG
AACAAGACCG ACCACGAATT TCTCCCCGCT TCGGTGCTCG AACGCTATCG GCGCCTTCTG
GGCGACGCGC TGCTGGAGAG GGGTTTCGTG GAGCCAGGGC ATCTGGAAGA CGCCGTGGGA
TCGGCGCGTG CGCGCGGCGT GCGGTTGGGG CAGGAGCTGC TGAGGCAGGG ATTGGTCGAA
GAGAGGCACC TCACGCAGGC TTACGCCTTG CAGCAGCAGT CGATGTACGT GCGTGCACAG
CCCGACCTCG TGCTTCTGGA GCTGATGGAT CGCATGCCGT TCGCCGCGGC GGATCGGTTC
GCCGCGCTGC CGCTGGTCGA GAGCGAAAAA GGATGGATCG TCGCCGTGGA CGACGACCTT
TCTTGCGCGG AGCGAGACGA ACTGGCGTTT CTGCTGGGCG AACCGACGTT CTTCCTGTTC
TCCAGCACAG CCGACCTGCT CGAAGCGTTC GAAGGCGCTC TCGCGTTCGA CAACGCGGCG
GAAGCTCCGC AACCTGCCGG GGCGGCGACG CTTCTGGAGG AGACGAGCGT AGAGCTGCCA
CAGGCGGGCA TGGCGCTAGC TTACGCGCTG CATCTCGGCC GCTCGGTCGA CGACATCGCT
TGCGAGATGG GCCTCGCCGT GTCGCGTTTC TCCTAG
 
Protein sequence
MDEQVVYWIG FFVALAFIVF GADDVLWDVF ALFRGTGKKR VKLSLINEKP PKMLAVVIAA 
WHEDAVLGEV VDNLVASAQY PRSLYRVFLG VYPNDAATVA VARALEVRHG GTVVCVVGDD
PGPTSKAANI NHTVRAIREY EAERDVRFAS VTIHDAEDVV HPNEFKMTNY LIDDYDALQF
PVFPLQRMPR LRLFFKTLTS STYADEFAEH HFRTMVMRDE LGFVPSAGTG FAIGRRVLDA
FRDEDLLPRN SLTEDYKLSL TLRMRGFRVH YVLEKVPRVD ARGRTVWDYI ATRSLFPSTF
KAAVRQKARW VYGITMQSAS MADVFGKSEL TFAERTFLYK GLKAKFANFV LLPGYAVLAY
FLVQTFAPQL ELPVMYPLHS PSWWMCVFLL FMMVERQVLR GRALANVYGW KTMAFSILLP
PLFPLRLLWG NLINMCATFR AWRQKIAYVL LRGREAKAAA APVVEHRGNA AEEEGERKPA
TDGDEAQTSN ATSAQEGPAW NKTDHEFLPA SVLERYRRLL GDALLERGFV EPGHLEDAVG
SARARGVRLG QELLRQGLVE ERHLTQAYAL QQQSMYVRAQ PDLVLLELMD RMPFAAADRF
AALPLVESEK GWIVAVDDDL SCAERDELAF LLGEPTFFLF SSTADLLEAF EGALAFDNAA
EAPQPAGAAT LLEETSVELP QAGMALAYAL HLGRSVDDIA CEMGLAVSRF S