Gene Elen_2895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2895 
Symbol 
ID8417226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3361867 
End bp3363174 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content66% 
IMG OID645025873 
Productdihydropteroate synthase 
Protein accessionYP_003183229 
Protein GI257792623 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0294] Dihydropteroate synthase and related enzymes 
TIGRFAM ID[TIGR01496] dihydropteroate synthase
[TIGR01498] 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGC GCTGCGCAAC CTATGAGTTC GATACGAGAA TGCCCATCGT CATGGGCATT 
CTCAACGTTA CCCCCGACTC CTTCTCCGAC GGAGGCCAGC ACGACGGCTT CGATGCCGCG
CTGGCTCATG CCGAGCGCAT GGCGGAGGAG GGAGCCCGCA TAATCGATGT GGGCGGCGAG
TCCACGCGGC CCGGCGCCGC GCCGGTGTCC GTGGACGAGG AGCTGGCGCG CGTGCTGCCG
GTGGTGCGCG CATTGGCGCA GCGCGATGTG TGCGTGAGCA TCGATACGCG CCACGCCGAA
GTTGCGCGCG CGTGCTTGGA AGCGGGCGCG GCCATCGTGA ACGACGTGTC CGGCTTCCGC
GACCCCGCCA TGGTGGATGC GGTGCGCGAC AGCGATTGCG GGCTGGTGGT CATGCACATG
CAGGGCGACC CCTCGACCAT GCAGAACGCG CCTTCGTATG ACGACGTGGT GGCCGACGTG
CGCGAGTGGC TGCGCGACCG GGCTGCCGCT TTGGAGGCTG CGGGCGTCGC GCACGACCGC
ATCTGCATTG ACCCCGGTCC CGGCTTCGGC AAGACGCCGT CGCAGACGCT GGAGCTGGTG
CGCAACTTCC AAGAGTTCGT GCGTCTGGGC TACCCAGTGA TGGTGGCGGT GTCGCGCAAG
AGCTTCTTGG GCTGGGCGTA CGGCATCGAC GAACCTTCTG CGCGCGACGA GGTTTCGGCT
GCCGAGGCGC TCATGGCCTG CGAGCTGGGA GCCAGCGTGG TGCGCGCGCA CAACGTGGCG
GCCACGGTTG CCGCGCTCGA AGGGCTGCGG CCCTACGCGC TCATCGGCAT GGGCTGCAAC
GTCCCGCTTG TGGCCTCGCC CGGCGAGGAG CGCGAGGGCA AGATCGCCAT GCTCAACCAG
GCCATCACCG AGCTGTGTTC GCTGCCCGAC TCGCAGATCG TCGACATCTC CAGCTTTTAC
GAGAGCGAGC CGGCCTACTA CCTCGACCAA GATTCGTTCG TGAACGCCGT GGTGCTTTTG
CGCACAGGTA TTCCGCCGAA AGAGCTTCTG GGCTACCTGC ATGCGGTGGA GAACAGCCTG
GGTCGTGTGC GCGAGGTTCG GAACGGGCCG CGCACGTGCG ACCTCGATAT CCTCGACTAC
CAGCTGTACG TCGTGGATGC CGATGTGCTC ACGTTGCCGC ATCCGCGCCT GCTGGAACGC
GATTTCGTAG TGCAGCCCCT GTTGGAGCTG CTCCCTGGCC ACGTGCTTGC CAATGACGTG
CCGGTTAGCG TCGATGGGGT TACGGTGGGG AAGAGCGTGC GGCTGTGA
 
Protein sequence
MIWRCATYEF DTRMPIVMGI LNVTPDSFSD GGQHDGFDAA LAHAERMAEE GARIIDVGGE 
STRPGAAPVS VDEELARVLP VVRALAQRDV CVSIDTRHAE VARACLEAGA AIVNDVSGFR
DPAMVDAVRD SDCGLVVMHM QGDPSTMQNA PSYDDVVADV REWLRDRAAA LEAAGVAHDR
ICIDPGPGFG KTPSQTLELV RNFQEFVRLG YPVMVAVSRK SFLGWAYGID EPSARDEVSA
AEALMACELG ASVVRAHNVA ATVAALEGLR PYALIGMGCN VPLVASPGEE REGKIAMLNQ
AITELCSLPD SQIVDISSFY ESEPAYYLDQ DSFVNAVVLL RTGIPPKELL GYLHAVENSL
GRVREVRNGP RTCDLDILDY QLYVVDADVL TLPHPRLLER DFVVQPLLEL LPGHVLANDV
PVSVDGVTVG KSVRL