Gene Elen_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0939 
Symbol 
ID8415229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1143385 
End bp1144458 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content61% 
IMG OID645023903 
Productintegrase family protein 
Protein accessionYP_003181300 
Protein GI257790694 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0828563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTAA AGCTTGCCAA GAACGGCACC TGGCTCGCGC AATTCAGGTG CAAGGACAAG 
TTTGGCAACG AAGTGCACAA ATGCAAGCGC GGGTTCGCGA CGGCGGAAGA GGCGCAGGCT
TGGGAGGACG AGTTCATCGC CAGCGCGGGC TGCACCATGG AGATGACGTT CGGCGAGTTC
TTCAAGGTTT ACGAAGCCGA TCTTCGCCCC AGGCTGCGCG AGCACACCTG GCGGCAGAAG
GAGTACGCGA TCAAGTCGAG GGTGCTGCCC TTCTTCGCGA ACAAGAAGAT GGACGAGATA
CGCACCATTG ACATCGTCCG TTGGCAGAAT GCCCTCATGG CGCCCGACGC AAACGACGGA
AAGCCCTACA GCGCCACGTA TCTGCGCACG CTCAACAACC AGCTGACCGC CATTCTGAAC
CACGCGGAGA GGTACTACGG GCTGAGCCCC AACCCCGCGA TGAGAACCGT GAAGATGGGC
GGCAAGGAAG CCCGCACGAT GAATTTCTGG ACCAAGGACG AGTATCTGAG GTTCTCCGAC
GCAACGATGG ACGATCCCCG CGCGTTCGTG ATCTTCGAGG TGCTGTACTG GACCGGCGTC
CGCGAGGGAG AGCTCCTGGC CCTCACGCCG GACTCGTTCG ATTGGAAGAA ATCGACCATG
CGCATCGACA AGTCCTACCA GAGATTGGGC GGCCGCGACG TGATAACCGA CCCGAAGACG
CCCAAATCGG TGCGAACCGT GAAGATGTCG CGCTTCCTGG CCGACGAGGT GCGCGATTAC
GCGAACCACC ACCCGGAGAT TGGCGAAGGC GATCGCCTGT TCCCCGTGTC GAAGCACTAC
ATATCGCACG CGATGCAGCG AGGATGCGCG GCGAGCGGCG TGAAAAAGAT ACGCGTGCAC
GATTTGAGGC ACAGCCACGT GAGCCTGCTC ATCAACATGG GCTTCACCGC CCTCGCCATC
GCAGACCGCA TGGGGCACGA GGCGACCGAC ATCACCTTCC GCTACGCCCA CCTGTTCCCG
AATGTGCAAG ACGACATGGC CAACGAGCTC GAAGAAGAAC GGGGTGGATT CTGA
 
Protein sequence
MSVKLAKNGT WLAQFRCKDK FGNEVHKCKR GFATAEEAQA WEDEFIASAG CTMEMTFGEF 
FKVYEADLRP RLREHTWRQK EYAIKSRVLP FFANKKMDEI RTIDIVRWQN ALMAPDANDG
KPYSATYLRT LNNQLTAILN HAERYYGLSP NPAMRTVKMG GKEARTMNFW TKDEYLRFSD
ATMDDPRAFV IFEVLYWTGV REGELLALTP DSFDWKKSTM RIDKSYQRLG GRDVITDPKT
PKSVRTVKMS RFLADEVRDY ANHHPEIGEG DRLFPVSKHY ISHAMQRGCA ASGVKKIRVH
DLRHSHVSLL INMGFTALAI ADRMGHEATD ITFRYAHLFP NVQDDMANEL EEERGGF