Gene Elen_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1056 
Symbol 
ID8415346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1278392 
End bp1280227 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content60% 
IMG OID645024019 
Productmyosin-cross-reactive antigen 
Protein accessionYP_003181416 
Protein GI257790810 
COG category[S] Function unknown 
COG ID[COG4716] Myosin-crossreactive antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.488016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00102136 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTACTATT CCAGCGGAAA CTACGAAGCA TTCGCCCGCC CGGCAAAGCC GGAGGGCATC 
GAGGACAAAT CGGCCTACAT CGTCGGCACC GGGCTGGCGG GCTTGTCGGC CGCGTGCTAC
CTGATCAGGG ATGCTCAGAT GGACGGCTCG CGCGTGCACC TGTTCGAACG CGACGCCGAA
CCGGGCGGCG CCTGCGACGG GTGGGAGTAT CCCCAGCTCG GCTTCACTAT GCGCGGCGGT
CGCGAGATGG ACAACCACTT CGAAGTGATG TGGGACCTGT TCCGATCCAT CCCCTCCATC
GAGGACGAGA ACATGAGCAT CCTCGATTAC TACTATCAAT TGAACAAGCG CGATCCCAAT
TATTCCCTGT GCCGCGCAAC GGTCAACCGG GGTGAAGACG CCGGCCTCGA CAATACATTC
AACCTGTCCG ACAAGGCGTG CATGGAGATC ATGAACCTGT TCTTCACCCC CGAAGAGCAA
CTCGACGACA AGCCCATAAC CGACTATTTT TCCGACGATG TGCTCGATTC CAACTTCTGG
CTGTACTGGC GCACCATGTT CGCCTTCGAG AGCTGGCACA GCGCGCTCGA GATGAAGCGC
TACATCCAGC GTTTCGTCCA CCACGTCGGC GGGCTCCCCG ACTTCAGCGC ACTGCGCTTC
ACCCGGTACA ACCAGTTCGA ATCCCTCATC CTGCCCATGG TGAACTACCT GAAAGGCCAG
GGCGTGCAGA TCCATCTGAA CACCGAGGTC GTCGACATCA CGTTCTCCAG CACCCCGGAG
CGCAAAGTAG CCACCGAGGT GCAGACGATA TGCGAAGGGG CCGACAAGAC GTTTCACCTC
ACCGACGACG ACCTGCTGTT CATCACCAAC GGCAGCTGCG TCGCCAATTC GTCCTTCGGA
TCGCAAGACG AGCCGGCCCA GTTCAACGCC GTGCTGGAAA AAGGAACCGG ATTCGACCTG
TGGCGACGTA TCGCACGGCA GGATTCCGCA TTCGGGAATC CGGAAAAATT CATCGGCGAT
CCGGAGAAGT CCAACTGGAT GAGCGCCACC GTCACGACGC TCGATGAGAA GATCGTTCCC
TATATCGAGA GCATTTGCCG TCGCGATCCG TTCTCGGGCG GCGTCGTAAC CGGCGGCATC
GTTACCGTAA AAGATTCGAA CTGGCTTTTA AGTTGGACGT TCAACAGGCA ACCCCAGTTC
AGAGCCCAGC CCGGCGATCA GCTGTGCGGC TGGATATACG GCCTGTTCAC CGATGTGCCC
GGCAACTACG TGAAGAAAAC CCTGCGCGAA TGCACCGGCA AGGAAATATG CATGGAGTGG
CTGTACCACC TGGGCGTTCC CGAATCGCAG ATCGAGGAGC TCGCCGAGAA CAGCGCCAAC
ACGGTTCCGT GCATGATGCC CTACATCACG GCATTCTTCA TGCCGCGCGC CGCAGGCGAC
CGACCCGACG TCGTGCCCGA AGGCGCGGTG AACTTCGCGT TCATCGGCCA ATTCGCGGAA
ACGCCGCGCG ACACCATTTT CACCACCGAG TATTCCATGC GCACCGGCAT GGAAGCCGTC
TACACGCTGT GCAACGTCGA CCGCGGCGTG CCCGAAGTAT GGGGCAGCGC GTTCGACATC
CGCGATTTGC TGAACGCCAC CACGTTGATC AGGGACGGCA AGCCCATCAC CGACATGGAG
ATGAATCCCC TCGAAAAGCT CGCCCTGCAC GAGGGAATCG AGAAGATCAA GGGGACCGAC
CTCTACGGTC TGCTGGTCGA GTTCGGCGTG ATTCCGCCCG ATCGCGGCGA CGCGCCCGCG
CCCGCCACCG GAGCCGTATA CCCCGGCATG CATTAG
 
Protein sequence
MYYSSGNYEA FARPAKPEGI EDKSAYIVGT GLAGLSAACY LIRDAQMDGS RVHLFERDAE 
PGGACDGWEY PQLGFTMRGG REMDNHFEVM WDLFRSIPSI EDENMSILDY YYQLNKRDPN
YSLCRATVNR GEDAGLDNTF NLSDKACMEI MNLFFTPEEQ LDDKPITDYF SDDVLDSNFW
LYWRTMFAFE SWHSALEMKR YIQRFVHHVG GLPDFSALRF TRYNQFESLI LPMVNYLKGQ
GVQIHLNTEV VDITFSSTPE RKVATEVQTI CEGADKTFHL TDDDLLFITN GSCVANSSFG
SQDEPAQFNA VLEKGTGFDL WRRIARQDSA FGNPEKFIGD PEKSNWMSAT VTTLDEKIVP
YIESICRRDP FSGGVVTGGI VTVKDSNWLL SWTFNRQPQF RAQPGDQLCG WIYGLFTDVP
GNYVKKTLRE CTGKEICMEW LYHLGVPESQ IEELAENSAN TVPCMMPYIT AFFMPRAAGD
RPDVVPEGAV NFAFIGQFAE TPRDTIFTTE YSMRTGMEAV YTLCNVDRGV PEVWGSAFDI
RDLLNATTLI RDGKPITDME MNPLEKLALH EGIEKIKGTD LYGLLVEFGV IPPDRGDAPA
PATGAVYPGM H