Gene Elen_2638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2638 
Symbol 
ID8416963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3059303 
End bp3061495 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content64% 
IMG OID645025616 
Productphage minor structural protein 
Protein accessionYP_003182978 
Protein GI257792372 
COG category 
COG ID 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0220022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTAT TCGTCTGCGA CCGCTGGGAG AACTACAAGG GCGCGATCAA GACGCTCGCC 
TCGTGCATCG ACACCCGCGA GCTGAACGGA GAGAACAGCC TCGAGATCGT CTCGTTCGCG
GTGCTCGACA AGGGCGACCG CATCGTTTGG CGCGACCTCA AGGGCCGTTG GCGCGAGAAC
ATCGTCGGAA GCTGCGACGA GAGCCACGCC GACACGGGCG TGGAGCGCAC GCACTACTGC
CCGAATTCCG CCATCGAGCT GCGCGGCGAC TACATCGAGG ACAAGCGCCC CGGCAACTGC
TCGGCGGCGA CCGCGCTCGC CTCGGCGCTA TCGTCGTCGC GGTGGACGGT CGGCCAGGTC
GATGACCTCG GCACGGCACA GGTCAACTTC TACCACGCGA GCGCGTGGCA GGCGATACAC
GACGTGGCCG ACGCCTTCGG CGGCGAGCTG CGCTTCGATA TCGCGGTTTC CGGCTCGCGC
GTCGTATCGC GCCGCGTGAG CCTGCTCGCG CACGTCGGGG CCGACACGGG CAAGCGGTTC
ACGTACCGGA AGGACCTATC GGAGTGCAGG CGCACCGTGT CGGAGGACGA CGTGTGCACG
GCGCTCTACG GCTGGGGCGC GTCCCTTGAG ACGACCGACG ATGACGGCAA CCTCACGGGC
GGGTACTCCC GCAAGCTATC GTTCGCGGAC GCGAACGGCG GGGTCAAGTG GATCGGAGAC
GACGCGGCCC GCGAGAGGTG GGGCCGCCCC GACGGCAAGG GAGGCAAGGC CCATGTCTTC
GGCGAGGCGA CGTTCGACAA GTGCGAGGAC CCGGCGGAGC TGCTGCGCCT CACGAGGGCG
GAGCTTGCCA TGCGCAGCGT GCCCAAGGTG AGCTATGCGG CGTCCGTCGC CGCGACCCGG
AACGCAGGCG TCGGGTTCGA GGGGTCAGAC GAGGGCGATG CCGTGGCCGT CATCGACGGC
GACCTCGGCG TGCGCGTGAT GGCCCGCATC ACGAAGATCA AGGAAGACCA GCTCGAAGAG
GAGAACACGA CCTACGAGTT CGGAAACTTC GGCGACCTCG CGGACGTGTT CAAGGCCCAG
AAGAGCGCGA TCCGCGAATC ATCGGATTCG GTGGCCTCCT ACGCGCAGAA CGCCGTCAAC
GCCTCGAACG CGCAGACGAA CAGGAACATG GGCGCGCTCG GCGACAGGTT CGAGCAGGAG
GTGAACGAGG CGATCAAGCA CGGCGACGAC CAGGTGGCCG CGCTCAAGGC CGAAATGGAC
AAGATACCCT CCGATATCCG TGAACAGCTC ATAGACATGA TCAACGAGGA GGTTAACACG
ACGGGCGGCT GGGTGTACGA GGAACCCGGC AAGGGCATAT TCGTGTACAA CTCGCGCCCG
GAGAGCGCGA CGAAGTGCGT GAAGATCGGC GGCGGCATCG TGGCCGTGGC GAACAGCAAG
TACAGCAGCG GCAATTGGTA CTTCAAAACC GCGATGACGG GCGACGGCAT CGTGGCCGAC
CGCATCTATA CCGGCCGCAT AACCGGCGGC AGCTCGTACT TCGACCTCGA CAGCGGCACT
ATCAACATGC GCAACGGCAT CATCAACATC ACCGACACCA ACGGGAACAC CGTAACGGTG
TCGCCGTCGC TCGGGTTCCA AGTGCGCGAC AAGAACGGCT CCATGATCGC CGGAACGGTG
ATAGCGAACG GCAAGGCGTT CTTCATGAGC CGCGCGGTTG GCGTGTCGTC GTCGCTCTAC
GTGACGACAG GGACGACAGC GGCGGGGAAT CCAGGCGCTT CGTTCGTCAA CTCGCAGGGC
AACTACCTCG ACGTCGAGGC GCTTCGCGCA TCGGCCGACC CGAGCAACAG GACGACGGGT
GCGGGCATGG CCTGCTTCAA CAAGCCATTC CTGCACACGA GCACGTACTA CAAGCAGCTA
TGGCTCCATC CGCCGTATTT CGACGACACG TATATGCAGC AGCCGAACGA GCAGCTATTC
CTTCGCTGCG GAGGCAACAG CCTCGGCGAA TCGCCGCTCG TGAAGCTGCA ATACAGCAGT
TCGCAAGGGC TGTTCTTCGG GAGCGGGTAC GTAACGCTCA AGCTCGACGA TTCGCACTAT
GTTCGCATCT CGTCGGACGG CGTGCAATGC CGCTGCGGCT CGAAGGGCTT CGGATGGCTC
AATGGCGAGT TCCACCAAAC GCTCGTCTGG TAA
 
Protein sequence
MQLFVCDRWE NYKGAIKTLA SCIDTRELNG ENSLEIVSFA VLDKGDRIVW RDLKGRWREN 
IVGSCDESHA DTGVERTHYC PNSAIELRGD YIEDKRPGNC SAATALASAL SSSRWTVGQV
DDLGTAQVNF YHASAWQAIH DVADAFGGEL RFDIAVSGSR VVSRRVSLLA HVGADTGKRF
TYRKDLSECR RTVSEDDVCT ALYGWGASLE TTDDDGNLTG GYSRKLSFAD ANGGVKWIGD
DAARERWGRP DGKGGKAHVF GEATFDKCED PAELLRLTRA ELAMRSVPKV SYAASVAATR
NAGVGFEGSD EGDAVAVIDG DLGVRVMARI TKIKEDQLEE ENTTYEFGNF GDLADVFKAQ
KSAIRESSDS VASYAQNAVN ASNAQTNRNM GALGDRFEQE VNEAIKHGDD QVAALKAEMD
KIPSDIREQL IDMINEEVNT TGGWVYEEPG KGIFVYNSRP ESATKCVKIG GGIVAVANSK
YSSGNWYFKT AMTGDGIVAD RIYTGRITGG SSYFDLDSGT INMRNGIINI TDTNGNTVTV
SPSLGFQVRD KNGSMIAGTV IANGKAFFMS RAVGVSSSLY VTTGTTAAGN PGASFVNSQG
NYLDVEALRA SADPSNRTTG AGMACFNKPF LHTSTYYKQL WLHPPYFDDT YMQQPNEQLF
LRCGGNSLGE SPLVKLQYSS SQGLFFGSGY VTLKLDDSHY VRISSDGVQC RCGSKGFGWL
NGEFHQTLVW