Gene Elen_0731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0731 
Symbol 
ID8415021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp920276 
End bp921997 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content66% 
IMG OID645023702 
Producthypothetical protein 
Protein accessionYP_003181099 
Protein GI257790493 
COG category[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGAAAG CGCTGCTGCT CGTCATCGGC ATCGTCGCAG TGTGCTTTCT CATCGCGAAC 
GCCGATTACC TGGCGAGCTT CCTCGCGACG CTCAAGACCG GCGCGCTCGT GCCGCTGGTC
GTGGCTTGCG TGCTCATGCT GGCGCGCCAC CTCGTGCAGG CGGCGTCCTA CGACGCGGCC
TTCGAGGCGG TCGGCCATAA GACGGGCTTC TGGCACAACG TCGTGCTCAT CTTCTCGCTG
GTGTTCATCA ACACGTTCTG CCTGTTCTCG GGGGCGACGG GCGTGGCGTT CATCATCGAC
GACGCGCACC GCACGGGCGC GGACGCGGGC ACGTCCACCA GCGGCGCTAT CCTCTCGCAG
ATCGGCTACT TCGCCGCCAT CCTCGTGATC TCCGTCATCG GCTTTCTCAC CATGCTGCTG
TCCGGCAGCA TGAACACGCT GTTCCTCGTC GGGGGGCTGG CGTTGGCTGC GGTGCTGGCG
GCGCTGTCCA GCATGTTCGT GGTGGGCTAC CGCAAACCGC GCGTGCTGTT CCGCCTGTTC
ATCGGCATCG AGTCGCTCAT CAACAAGGCG CTGGGGCTGC TGAAGAAGCA TCTCAAGCCG
GCCTGGGGGC GCAAGATGGC CAGCTCCTTC ATCTCGTCGG CGGGCATCCT CGCGAAGAAC
CCGCAGGGCA CCATGGTCAC CGTGTCCTAC GCGTCGTTCT CGGCCATCCT CAACATGGCG
TGCCTCGTGG CCATCGGCTA CGCGTTCGGC TTCGAGAACG TAGCCGCGCT CGTGGCGGCG
TTCGCGGTGG CGGCCATCTC GGTCATCCTG AGCCCCACGC CTCAGGGCGT GGGCGTGGTG
GAGGCGGCCA TCGCCGCCAT CCTCACGGCG CACGGATGCT CGCTGGCCAC AGCCACGGCC
ATCGCGCTGG TGTACCGCGG TATCATGTTC TGGATCCCTT TCTGCATCGG CGCGCTGCTG
CTGTCGCAGT CGGGGTTCTT CGCCGACAAG AAGAGCCCTA CCGAGGAGAA GCGCGCGAAG
GACACGGCTT GGGTTTCGGG CACCATCGTG CTCATCGTGG GACTCGTGAA CATCGGCATG
GCGTTGATTC CGCAGACGTT CAGGCCGTTC ACCGCGCTCA CGGATTGGAT CAACATGGGC
GGCCTGCTCA TCGGTCCGTT CCTTATCGTG GGGAGCATCG TGCTCGTGGT GCTGGCCGTG
GGACTCATCC TGCGCTTCCG TACGGCGTGG GCGCTCACGT TGGGCGTGCT CGTGCTGGTG
GCTGGCGCGG AGTTCCTCTA CGTGAACACC GTGCAGGTTG CCGTGGCGGC TCTGCTGCTG
GTGATGTGGC TGTTCTGGAA GCGCGACGCG TTCGACCGTC CTATCGCTCC GCAAGACGAC
GCGCCGCGCC TCGTGCGCGA GTTCCGCGAG AACGTCGAGC GGTTCCGCGC TTGGCGCGCC
AGGCGGGCTG CGGCGAAGGC AGCGGGTGAG CAGCCGCTCG CCGGCATCGG CAGCGCCATA
GCCTCGCGCC GAGAAGAGGG CGGCGCGCGC TCACCTGCGA AAAGGAAAAC GGGTTGGGAA
CAACGTGCCG AAAAAGGCGC GGAGATCATC CGCGAGGCTG GACATGAGGG TATACTGCCC
TTTGGCGATG ATGCCGAGCC AGCGCCCGCG GGGGCGGCAA CGGCAGGCGT CGCCGATTCT
GCAGCAATGA AGGAGGAGTC AAACCATGAT CGTGCTCGAT GA
 
Protein sequence
MKKALLLVIG IVAVCFLIAN ADYLASFLAT LKTGALVPLV VACVLMLARH LVQAASYDAA 
FEAVGHKTGF WHNVVLIFSL VFINTFCLFS GATGVAFIID DAHRTGADAG TSTSGAILSQ
IGYFAAILVI SVIGFLTMLL SGSMNTLFLV GGLALAAVLA ALSSMFVVGY RKPRVLFRLF
IGIESLINKA LGLLKKHLKP AWGRKMASSF ISSAGILAKN PQGTMVTVSY ASFSAILNMA
CLVAIGYAFG FENVAALVAA FAVAAISVIL SPTPQGVGVV EAAIAAILTA HGCSLATATA
IALVYRGIMF WIPFCIGALL LSQSGFFADK KSPTEEKRAK DTAWVSGTIV LIVGLVNIGM
ALIPQTFRPF TALTDWINMG GLLIGPFLIV GSIVLVVLAV GLILRFRTAW ALTLGVLVLV
AGAEFLYVNT VQVAVAALLL VMWLFWKRDA FDRPIAPQDD APRLVREFRE NVERFRAWRA
RRAAAKAAGE QPLAGIGSAI ASRREEGGAR SPAKRKTGWE QRAEKGAEII REAGHEGILP
FGDDAEPAPA GAATAGVADS AAMKEESNHD RAR