Gene Elen_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1696 
Symbol 
ID8415995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2000919 
End bp2002079 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content61% 
IMG OID645024663 
Productprotein of unknown function DUF1113 
Protein accessionYP_003182051 
Protein GI257791445 
COG category[S] Function unknown 
COG ID[COG4905] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.176656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.89414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA CGGTGAACGA GGAAGGTCGC CTCCCGGTGC CGCTCAAGGT ATTCGGCGTT 
CTCAGCATAG TAGGAGGACT CGCTTCCCTC GGCGACCTGG CGCCGGTGAT CTTCTCCTTC
GTGCAAGGCG TCGGCGGCGG CACGTACCGA ACGGCGTCCA TCGTCATCTT CGCCGGTCTC
ATGGCGGTGC TCACGTCATC GGCCGTGCTG TTCGTGTTCT TGGGCGTGCG GCTGCTGCGC
AATCGCCGAG CCCACGCGGC GCAGGCGGCG AACACCCTCG CGGCGCTCAC CGTGCTGGCC
GTCATCGGCA CGCTGATGCT GTTCGGCTTG TCTGCGCACC ACATCGGCTA CCTCGTTCAG
ATCGTCATTC TCGTCGCGGT GACCAGCTAT CTCGACCCGC AGTTGCACGA GGAGCGCCGG
CTACAGCGCA AGCTCAAGAA GATGGACAAG GAGCAGCGCT CGGAAGAGAG GAGGTTGGCG
CGCGAGCGCA AGCCGAAAAA AGGGTTCATC ACGCTGAACT TCTTCAACCT GTTCTGGATC
TTCGTGGTGT GCTGCGTGCT GGGGCTCATC ATCGAGGTGC TGTTCCATTT CGCGCTATAC
CATGAGTATC AGGATCGCGC TGGGCTTCTG TTCGGGCCGT TCTCTCCCAT CTACGGTTTC
GGCGCTCTGC TCATGACCAT CGCGCTCAAC CGTTTCCACG ACAAGCCCGT GTGGGTGGTC
TTCCTCGTGA GCGCCGTCAT CGGCGGTGCG TTCGAGTATT TCACGAGCTG GATCATGGAG
TTCTCGTTCG GCATCCGCGC ATGGGACTAT TCGGGCACGT TCTTGTCCAT CGACGGACGC
ACGAACTTCG TGTTCATGGT GATGTGGGGC GTGCTGGGCG TGGCGTGGAT CAAGCTTCTG
CTGCCGCGGC TTTTGAAGTT GATCAACCTC ATTCCGTGGA ACTGGCGCTA CGCGGTGACG
GCCGTATGCG CGGGGCTCAT GCTGGTGGAC GGCGTCATGA CCGTGCAGTC CATCGACTGC
TGGTACGCGC GATCGGCGGG CAAGGCGCCC GACACGCCCA TCGAGGAGTT TTACGCGAAG
CATTTCGACA ACGCCTACAT GGAGCACCGC TTCCAGACCA TGACCATGGA TGTGAACGAC
GCCGCTCGAG CCGATCGGTA A
 
Protein sequence
MNDTVNEEGR LPVPLKVFGV LSIVGGLASL GDLAPVIFSF VQGVGGGTYR TASIVIFAGL 
MAVLTSSAVL FVFLGVRLLR NRRAHAAQAA NTLAALTVLA VIGTLMLFGL SAHHIGYLVQ
IVILVAVTSY LDPQLHEERR LQRKLKKMDK EQRSEERRLA RERKPKKGFI TLNFFNLFWI
FVVCCVLGLI IEVLFHFALY HEYQDRAGLL FGPFSPIYGF GALLMTIALN RFHDKPVWVV
FLVSAVIGGA FEYFTSWIME FSFGIRAWDY SGTFLSIDGR TNFVFMVMWG VLGVAWIKLL
LPRLLKLINL IPWNWRYAVT AVCAGLMLVD GVMTVQSIDC WYARSAGKAP DTPIEEFYAK
HFDNAYMEHR FQTMTMDVND AARADR