Gene Elen_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0839 
Symbol 
ID8415129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1035183 
End bp1037243 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content65% 
IMG OID645023805 
ProductRelaxase/mobilization nuclease family protein 
Protein accessionYP_003181202 
Protein GI257790596 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATGC TCAAGACGCT CGCCCGGAAG TTCGGCGGCT GCAAGCAGAT AAAGGACTAC 
CTCGCGCGCA ACGGCCGCGC CGTGACGTTC CGCTCAAGCT CAATCGATCT CGATAGCGCC
GGCTGGGACG GGCAGATGGA CGCGACGCGC CAGGTGTTCG GCAAGGACAC CGGGCGAAAG
TACTACCATT TCGTAATCTC ACCCGATCCC GAGGACGGCC TGGATGCCAA GGCGGTAGAC
GCATTGGCAT GCGATTGGGT GCGCGAGCGC TACAGCGGAT ACCAGTGGGT GATCGAGACT
CATATAGATA ACGGCATTCC GCATGCACAT ATCGTCGTCA ACTCCGTCAA CCCCGTTGAC
GGGAGGAAGA TACACCTCGA CGACGATGAC GTGCAGGCCG ACGCGATGGA GTTGCAGAGA
ATCTGCCGCG ACTACGGCTA CACCGCCTTC GACAACTTCA AGTTCACGAG GGACGAGGAC
GGTTCCTGGT ACGCACGCAC GCCGCGCCCC GACCGCCGGC GCGAGGTCGT CTTGCAGGAG
GCGCGCCCGG CGCGCCGTCA TGTGACGGAC GCCCAGCGCC GCGCCAGGTA CAGCGGCAAG
AAACTCTGGA CCGATGCGAT GCACGACGAT ATCGAGTCGG CTTTGCAGGG GTGCCGGACA
TGGCCCGCGC TCGAGAGGCG TTTGGCGGAG AAGGGCTACC GCATCAATAT CAACCGCCGC
GGAGTGCTGA CCTTCTACCC GCCCGAGGGC AAGGGCCATC CCGTGAAGGG TTACAAGCTG
GACGACTCCT ATACGGTCGA GGGATTGCGC GAGCGCCTTG CCGTGCGACT CGAGGGACAG
CGCCCGAACG TCAGGATGTG CACCGAGGAC CTTGTCCCGG ACGTCGTCCT GCCGCCCATG
ACTTTCGAGG CGGTCGCATC CAACAATCTC GAGCGGTCTA AGAGGATCGA GCGCTCCGCC
GCCCGCCTTG CCGCCGCGGC CGACGCCGTG AACATTATCC GCGAGCGCGG CTACCGCTGC
TATGCGGACA TGGCGGCCGA TGCCAGGCGC CTCGCCGCCG AGGTCGACGC GCTCGACGCG
AGGGTCGACG CGCTCAAGCT GGAGGCCGAG CAGACCGCCG AGGCCATCCG CCAGGTCGAG
GTCTACACGT CTTGCGTCGC GCAGTTGACC GCGCGCCCGG AAGGAGCCGA AGCGCTCGCG
GCATGGCGCG AGGACAATGC AGACGAGCTT CGCGAGATGG AATCGATCCG CAAGTGGATG
GCCGAGCGCG GAATCGTGCT CACCGATATG ACATACACGG ACCTCCTCGC ACGCCGCGAC
GGCGTCAGGG CCGACGTGGC GAACCTCGCT TCCATGGCAG TCGATATCCG CAACCGCTCG
CGCCGACTTT CCGCCGCTGT CGATGTCCTG GACTCATCGC AGATGCCGAT TGGCCCCACA
GATGTACGGG GCCGCGGCGA GAGGCAGAAG GGCGCCGGTC ACGGCCTGCG CGACGCCAAG
GTCATCACCG CGAACCAGAC CGCCGAGCTG ACGCGCGAGT ACACGCGCTA TCGCAAGGCC
TATTCGGGGG CCATCGCGGC GGGCGCCCTC ACGCCGGATA AGATCAAGCA GATTGAGCGC
GACCACGAGC GGGCCAAGGC GAAGATCATC GGCGAGCAGC GCGAGGAGGA CCTGCGCGCG
GAACGCTCGG CGTACGGGAT GGCCGGAAGC GAATGGGCGC AGGATGCTCC CGTGACGGTC
GCCGCACCCG GCGGGACGGA CAAGAGGTCG AAGCCCGGAT GGAAGCCGTA CCTCCAACGC
CCCGCGACCG AGAAGCAGAT GAAATATCTC GAAGACCTCG TAGAGTCCGG CATCATCGCG
CAGGCCGACA TCGACAGCCT CGGCGGCGAG CCGACCATCG CGGACGTCAA CGCGCTCCTG
AACTCGCATC CCAAGGTCAA GAACCTGGAG CTTGAGAAGA TGGACGGCGA CGGCGCGGAC
GGCACACGGA CGTACACGCA GACCCGAGAC GATATCGAGG ACGACGGCTA TACGCCGGCG
AACTACACGA GGAGGAGATA G
 
Protein sequence
MAMLKTLARK FGGCKQIKDY LARNGRAVTF RSSSIDLDSA GWDGQMDATR QVFGKDTGRK 
YYHFVISPDP EDGLDAKAVD ALACDWVRER YSGYQWVIET HIDNGIPHAH IVVNSVNPVD
GRKIHLDDDD VQADAMELQR ICRDYGYTAF DNFKFTRDED GSWYARTPRP DRRREVVLQE
ARPARRHVTD AQRRARYSGK KLWTDAMHDD IESALQGCRT WPALERRLAE KGYRININRR
GVLTFYPPEG KGHPVKGYKL DDSYTVEGLR ERLAVRLEGQ RPNVRMCTED LVPDVVLPPM
TFEAVASNNL ERSKRIERSA ARLAAAADAV NIIRERGYRC YADMAADARR LAAEVDALDA
RVDALKLEAE QTAEAIRQVE VYTSCVAQLT ARPEGAEALA AWREDNADEL REMESIRKWM
AERGIVLTDM TYTDLLARRD GVRADVANLA SMAVDIRNRS RRLSAAVDVL DSSQMPIGPT
DVRGRGERQK GAGHGLRDAK VITANQTAEL TREYTRYRKA YSGAIAAGAL TPDKIKQIER
DHERAKAKII GEQREEDLRA ERSAYGMAGS EWAQDAPVTV AAPGGTDKRS KPGWKPYLQR
PATEKQMKYL EDLVESGIIA QADIDSLGGE PTIADVNALL NSHPKVKNLE LEKMDGDGAD
GTRTYTQTRD DIEDDGYTPA NYTRRR