Gene Elen_0480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0480 
Symbol 
ID8414764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp612262 
End bp613482 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID645023451 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003180854 
Protein GI257790248 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGCA TCGGGATAGA CATCGCCAAG AGAAGCCACG TCGCAAGCGC CGTCGACGAG 
GACGGGGGCA CCGTCGTCGA GTCGTTCAAG TTCAACAACA CGGCGAGCGG CTTCGCGAAG
ATGCTGAAGA GGCTTCGCAA GGCTGACGCG ACCCCGGAGA ACAGCTTGGT CGGAATGGAA
GCCACGGGGC GCTACTGGGT GGCCCTGTTC GACTTCCTCT CCTCCCACGG CTACGAGGCC
GCGGTGATCA ACCCCGTCCA GACCGACGCC TTCCGCGACG TCTGGACGGT CCGCAAGGTC
AAGACCGACG CCCTTGACGC CGCGATGATC GCCGACCTCG TCCGCTACAA GCGCTACTCG
CCCTCGGCGC TCGGCGACGA GGCGACCGAG GAGCTGCGCA ACCTCGCCCG CTACAGGATG
TCGCTCGTGG AGCGCTCCAC AGCCTTGAAG AACAGGGCGA CGGCCATCCT CGATAGGACG
TTCCCCGAGC TTGCAGGCCT GTTCAGCAAC GCGTACTGTC CGACCCGCCG CGAGCTGCTG
CAGCACTGCG CCACGCCCGA CCAGGTGCTC GGAACCGACG TCGGGACGCT CGAGCGCATC
CTCAGGGAGG CTTCGCGCGG GAGGTGCGGG CGAGCCAAGG CCGAGAGGCT CAAAGCGCTT
GCCAAGGAAT CCGTGGGCGT CGGGTTCGGC TCCAAGACGC TCGCCTTCGA GGCGAGGCTT
TTGATGGAGG AGCTCGACTT CGTGCAGGAC CAGGTCAAGG AGGTCGAAAG GGAGCTGGCC
CGCCTGCTGG AGCGGACGCA GGGCAAGTGG CTCGTCACCA TCCCCGGCAT AGACGTCGCC
CTCGCCTCCG TCATCGCGGG CGAGATCGGC GACCCGAACG CCTTCGACGA CCCCCACAGG
CTCATGGGTT ATGCCGGCAT GGACCCCACG AGAAGCGAGT CCGGCGAGAC CGTGAGGTCC
GACGGGCGCA TGTCCAAGCG AGGGCCCGGG GCGCTGCGAT GGGCGTTCAT GCAGGCCGCC
GACTGCGCCA GGAAGAACGA CCCCTACTTC GGAGACTATT ACGACAGGAA GAGGGGCGTG
GACGGGAAGC ACCACTACGT CGCCCTCTCG GGCGTCGCCC GCAAGCTCAT GGGCGTGGCG
CTGGCGGTGA TGAAGGAAGG GCGGCCCTAC GAGCCGGCCC CTCCCCGCAA CCACCGCCCC
GGCCACCTGA AGGAGGCGTG A
 
Protein sequence
MICIGIDIAK RSHVASAVDE DGGTVVESFK FNNTASGFAK MLKRLRKADA TPENSLVGME 
ATGRYWVALF DFLSSHGYEA AVINPVQTDA FRDVWTVRKV KTDALDAAMI ADLVRYKRYS
PSALGDEATE ELRNLARYRM SLVERSTALK NRATAILDRT FPELAGLFSN AYCPTRRELL
QHCATPDQVL GTDVGTLERI LREASRGRCG RAKAERLKAL AKESVGVGFG SKTLAFEARL
LMEELDFVQD QVKEVERELA RLLERTQGKW LVTIPGIDVA LASVIAGEIG DPNAFDDPHR
LMGYAGMDPT RSESGETVRS DGRMSKRGPG ALRWAFMQAA DCARKNDPYF GDYYDRKRGV
DGKHHYVALS GVARKLMGVA LAVMKEGRPY EPAPPRNHRP GHLKEA