Gene Elen_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1071 
Symbol 
ID8415361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1293917 
End bp1295551 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content73% 
IMG OID645024034 
ProductIntegrase catalytic region 
Protein accessionYP_003181431 
Protein GI257790825 
COG category[L] Replication, recombination and repair 
COG ID[COG2801] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.296403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000028101 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTTACG GCAAAGAATA CAAGGAGGAG GTCCTGAGCA GGTTCCACGC GAGCGGCATG 
TCGATGCGCG CGGCGTGCTC GAGCCTCGAG GGGTTCCCGT GCGCGGCGAC GCTCTCGGCC
TTCGCGCGCG AGGAGGGGGC CGGCCTCCTG CGCCCGCCCG CGCTCGCGGT GCCGGGCAGG
TGCGAGGGCC GCCGGGCATG GGAGCCCTAC CCCCTCGGGA CGAAGCGGGA GGCGATGCGG
CTCCTCGCGG GCGGCATGGA GCCGCGCTTC GTCGCCGGGC GCCTCGGGAT CGCGAGCGCC
GCCCCCGTCC GCCTGTGGGC CTCCAGGCTG GGGCGCCTCG ACGGGCTCGA CGGCCGGTCC
CGTCCCGACG GCGAGCGCGA GCGCGGGGAA GCTGTTACGA TGTTCGCGCG AGGGTCGTCC
GTCTCCGAGA TAGCCGGACG GATCGGCGCG GACAGGAGGA CCGTGCGCCG CTGGCTGGAC
AAGGCCGGCG TCGAGCGGAA GCGCGCGACG AAGGGGAAGG GCGGCGAGGG CGTGGCGAAG
GAAGAAGGCG GCGAGCGGGG CGAATGGTCC CGCGCATGGG GCGACCTCCC CGAAGGCGAC
CCCGTCGAGC GGGCGCGGCT GGCCGAGGTC AGGCTCGCGG AGGCGCTGGC GGTGTTGGAC
GTCCTAAAAG CACCAGGCCC GGGCTCTTTG AGCAATTCGG AGAAGCGCCG GGCGGGCGAG
AGGGCGAGGG CGATGGCGGC GAGGGCGAGG GTCGATGACG TCCTGAGGGA TTTCCGCATC
GCCAGGAGCA CGTACTTCTC GCAGGCGGCG ATGGCGGCCA GGCCCGACAG GCACGCGGCC
CTGCGGGCGC GCGTGCGCGC GGCCTTCGAG GGCTCGAAGG GCCGCTACGG GTCGCTGAGC
GTGTGGGCGG CCCTGCGGCG GGGCGAGGGC GCGCCCGTGC GCGCCCGCGA CCTCGCGCCC
GGGGACATGG AGGCCCCCGT CGTCGTCTCC GAGAAGGTCG TGCGCCGGAT CATGCGCGAG
GAGGGGCTCG TCCCGGTCCA GGTCAAGGAG CGCCGGCGCC ACAGCTCCTA CGCGGGCGAG
ACCGACGAGC GCCCCGCGAA CCTGCCGCTT CGAGAGGACG GGACGCACGG CTTCCGCGCC
GACGCGCCGG GCAGGCTCGT CGTGACCGAC GTGACCGAGT TCGACCTCGG CGGCCTCAAG
GTCTACCTCT CCCCGATCAT AGACTGCTTC GACGGCTGCC CGGTGGCGTG GCGGACGTCG
ACGCGCCCGG ACGACGAGCT GACGGCGGGC TCGCTGGAGG ACGCGCTCGG GCGCCTGGAG
GAGGGCTGCG CCGTCCACAC CGACGGCGGC GGCAACTACC GCTCCGCCAG ATGGAAGGGC
GTCTGCGAGG CCAACGGCCT CGTCAGGTCG ATGTCGCGCA AGGCCAAGAG CCCCGACAAC
GCGAGGGCGG AGGGCTTCTT CGGGACGCTC AAGCAGGAGT TCTTCTACGC GAGGGACTGG
AAGGGGACGA CGAAGGGGAG CTTCGTGCGG GCCCTCGACG AGTACATCGT GTGGTATCGT
GACGAGAAGA TCGAGAGATC GCTCGGATGG AAGACGATAG CGGCCCATAG GGCGGCGCTC
GCCGCAGCCG CGTAG
 
Protein sequence
MAYGKEYKEE VLSRFHASGM SMRAACSSLE GFPCAATLSA FAREEGAGLL RPPALAVPGR 
CEGRRAWEPY PLGTKREAMR LLAGGMEPRF VAGRLGIASA APVRLWASRL GRLDGLDGRS
RPDGERERGE AVTMFARGSS VSEIAGRIGA DRRTVRRWLD KAGVERKRAT KGKGGEGVAK
EEGGERGEWS RAWGDLPEGD PVERARLAEV RLAEALAVLD VLKAPGPGSL SNSEKRRAGE
RARAMAARAR VDDVLRDFRI ARSTYFSQAA MAARPDRHAA LRARVRAAFE GSKGRYGSLS
VWAALRRGEG APVRARDLAP GDMEAPVVVS EKVVRRIMRE EGLVPVQVKE RRRHSSYAGE
TDERPANLPL REDGTHGFRA DAPGRLVVTD VTEFDLGGLK VYLSPIIDCF DGCPVAWRTS
TRPDDELTAG SLEDALGRLE EGCAVHTDGG GNYRSARWKG VCEANGLVRS MSRKAKSPDN
ARAEGFFGTL KQEFFYARDW KGTTKGSFVR ALDEYIVWYR DEKIERSLGW KTIAAHRAAL
AAAA