Gene Elen_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0003 
Symbol 
ID8414278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3231 
End bp4331 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content63% 
IMG OID645022975 
ProductDNA polymerase III, beta subunit 
Protein accessionYP_003180387 
Protein GI257789781 
COG category[L] Replication, recombination and repair 
COG ID[COG0592] DNA polymerase sliding clamp subunit (PCNA homolog) 
TIGRFAM ID[TIGR00663] DNA polymerase III, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000553224 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000000389013 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAAATTCA GCATCAACCA ATCGGAATTG CAGAACGCGT TGTCCGTGGT GCTCAAGGGC 
ATCGCAACCC GATCGACGCT TCCCATTCTG TCGGGCATCT ACCTCGACGC GCACGACGAC
ACCCTTACGC TTCAAGCGAC CGACCTCGAG CTGTCCATCC AGTACTCGGT TGCGGCGCTC
ATCGAGGAGA CGGGCAAGGC CGTGGTTCCC GGCAAGCTGT TCTCCGAGAT CGTGAAGAAC
CTGCCCGACG CAGCCGTGCA CGTGGCAGCC GAGGACGACT CGGCGGTCAT CACGTGCGAC
ACGGCGTCGT TCTCCATCAA GACGCTCGAC GCGGAGGACT TCCCCGGATT CCCTCACGTG
GACGTGCAGC AGGAGGTGTC CATCCCCTTC ACGCAGTTCG CTTCCATGGT GAAGCGCGTG
GCGCGCGTGG TGTCGAAGGA CGAGAGCCGC GCCATACTCA CGGGCGTGCT GATCACGCTC
GAGGACACCA CGCTCAAGAT GGTGGCCACC GACTCGTATC GCCTCGCCAT CACCGAGGCC
GAGCTTCCCG AATCCAGCGC CGAGGAGTTC CAGGCGGTCA TCGCGGGATC GTTCTTGCAG
GAGATATCTT CGCTGCCGCG CTCGGAGGAC GACCTCAAGC TGGCGCTGGC TGAGAACCAG
ATCGTCGTCA CGTACCATGA CACCGTATTC ATCAACCGCC GCCTGGAAGG CAACTTCCCG
AACTACCGCC AGCTGCTGCC CGATTCCTAC GCCACGCGCG TGAGCATGGA CGTGGGGCAC
CTCGTGGCCG GCGTGAAGCG CACGTCGCTG CTGGGGCAGA CCAGCTCGCC GGTGCGCTTC
GCCATCAACA TGGCGTCGCA GACCGTGCAG CTGTCCGCCG TGGCGCAGGA CGTGGGCTCG
GCCCAGGAGA CGCTGTCGTG CGAGGGCGAA GGCGAGGACG TGGAGATCGC TTTCAACTAC
GCCTACGTGT TGGACGGGCT GTCCTCGGTG AGCACCGACA ACGTGTTCCT CGAAGTGCAG
TCGTCCATGA AGCCCGGCAT CTTCAAAGCT GACGAGGGCG AGAACTTCCT GTACCTCGTC
ATGCCCGTGC GCATCGCGTA A
 
Protein sequence
MKFSINQSEL QNALSVVLKG IATRSTLPIL SGIYLDAHDD TLTLQATDLE LSIQYSVAAL 
IEETGKAVVP GKLFSEIVKN LPDAAVHVAA EDDSAVITCD TASFSIKTLD AEDFPGFPHV
DVQQEVSIPF TQFASMVKRV ARVVSKDESR AILTGVLITL EDTTLKMVAT DSYRLAITEA
ELPESSAEEF QAVIAGSFLQ EISSLPRSED DLKLALAENQ IVVTYHDTVF INRRLEGNFP
NYRQLLPDSY ATRVSMDVGH LVAGVKRTSL LGQTSSPVRF AINMASQTVQ LSAVAQDVGS
AQETLSCEGE GEDVEIAFNY AYVLDGLSSV STDNVFLEVQ SSMKPGIFKA DEGENFLYLV
MPVRIA