Gene Elen_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0729 
Symbol 
ID8415019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp917566 
End bp919200 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content73% 
IMG OID645023700 
ProductIntegrase catalytic region 
Protein accessionYP_003181097 
Protein GI257790491 
COG category[L] Replication, recombination and repair 
COG ID[COG2801] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACG GCAAAGAATA CAAGGAGGAG GTCCTGAGCA GGTTCCACGC GAGCGGCATG 
TCGATGCGCG CGGCGTGCTC GAGCCTCGAG GGGTTCCCGT GCGCGGCGAC GCTCTCGGCC
TTCGCGCGCG AGGAGGGGGC CGGCCTCCTG CGCCCGCCCG CGCTCGCGGT GCCGGGCAGG
TGCGAGGGCC GCCGGGCATG GGAGCCCTAC CCCCTCGGGA CGAAGCGGGA GGCGATGCGG
CTCCTCGCGG GCGGCATGGA GCCGCGCTTC GTCGCCGGGC GCCTCGGGAT CGCGAGCGCC
GCCCCCGTCC GCCTGTGGGC CTCCAGGCTG GGGCGCCTCG ACGGGCTCGA CGGCCGGTCC
CGTCCCGACG GCGAGCGCGA GCGCGGGGAA GCTGTTACGA TGTTCGCGCG AGGGTCGTCC
GTCTCCGAGA TAGCCGGACG GATCGGCGCG GACAGGAGGA CCGTGCGCCG CTGGCTGGAC
AAGGCCGGCG TCGAGCGGAA GCGCGCGACG AAGGGGAAGG GCGGCGAGGG CGTGGCGAAG
GAAGAAGGCG GCGAGCGGGG CGAATGGTCC CGCGCATGGG GCGACCTCCC CGAAGGCGAC
CCCGTCGAGC GGGCGCGGCT GGCCGAGGTC AGGCTCGCGG AGGCGCTGGC GGTGTTGGAC
GTCCTAAAAG CACCAGGCCC GGGCTCTTTG AGCAATTCGG AGAAGCGCCG GGCGGGCGAG
AGGGCGAGGG CGATGGCGGC GAGGGCGAGG GTCGATGACG TCCTGAGGGA TTTCCGCATC
GCCAGGAGCA CGTACTTCTC GCAGGCGGCG ATGGCGGCCA GGCCCGACAG GCACGCGGCC
CTGCGGGCGC GCGTGCGCGC GGCCTTCGAG GGCTCGAAGG GCCGCTACGG GTCGCTGAGC
GTGTGGGCGG CCCTGCGGCG GGGCGAGGGC GCGCCCGTGC GCGCCCGCGA CCTCGCGCCC
GGGGACATGG AGGCCCCCGT CGTCGTCTCC GAGAAGGTCG TGCGCCGGAT CATGCGCGAG
GAGGGGCTCG TCCCGGTCCA GGTCAAGGAG CGCCGGCGCC ACAGCTCCTA CGCGGGCGAG
ACCGACGAGC GCCCCGCGAA CCTGCCGCTT CGAGAGGACG GGACGCACGG CTTCCGCGCC
GACGCGCCGG GCAGGCTCGT CGTGACCGAC GTGACCGAGT TCGACCTCGG CGGCCTCAAG
GTCTACCTCT CCCCGATCAT AGACTGCTTC GACGGCTGCC CGGTGGCGTG GCGGACGTCG
ACGCGCCCGG ACGACGAGCT GACGGCGGGC TCGCTGGAGG ACGCGCTCGG GCGCCTGGAG
GAGGGCTGCG CCGTCCACAC CGACGGCGGC GGCAACTACC GCTCCGCCAG ATGGAAGGGC
GTCTGCGAGG CCAACGGCCT CGTCAGGTCG ATGTCGCGCA AGGCCAAGAG CCCCGACAAC
GCGAGGGCGG AGGGCTTCTT CGGGACGCTC AAGCAGGAGT TCTTCTACGC GAGGGACTGG
AAGGGGACGA CGAAGGGGAG CTTCGTGCGG GCCCTCGACG AGTACATCGT GTGGTATCGT
GACGAGAAGA TCAAGAGATC GCTCGGATGG AAGACGATAG CGGCCCATAG GGCGGCGCTC
GCCGCAGCCG CGTAG
 
Protein sequence
MAYGKEYKEE VLSRFHASGM SMRAACSSLE GFPCAATLSA FAREEGAGLL RPPALAVPGR 
CEGRRAWEPY PLGTKREAMR LLAGGMEPRF VAGRLGIASA APVRLWASRL GRLDGLDGRS
RPDGERERGE AVTMFARGSS VSEIAGRIGA DRRTVRRWLD KAGVERKRAT KGKGGEGVAK
EEGGERGEWS RAWGDLPEGD PVERARLAEV RLAEALAVLD VLKAPGPGSL SNSEKRRAGE
RARAMAARAR VDDVLRDFRI ARSTYFSQAA MAARPDRHAA LRARVRAAFE GSKGRYGSLS
VWAALRRGEG APVRARDLAP GDMEAPVVVS EKVVRRIMRE EGLVPVQVKE RRRHSSYAGE
TDERPANLPL REDGTHGFRA DAPGRLVVTD VTEFDLGGLK VYLSPIIDCF DGCPVAWRTS
TRPDDELTAG SLEDALGRLE EGCAVHTDGG GNYRSARWKG VCEANGLVRS MSRKAKSPDN
ARAEGFFGTL KQEFFYARDW KGTTKGSFVR ALDEYIVWYR DEKIKRSLGW KTIAAHRAAL
AAAA