Gene Elen_2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2551 
Symbol 
ID8416875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2983336 
End bp2984283 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content68% 
IMG OID645025532 
ProductShikimate dehydrogenase substrate binding domain protein 
Protein accessionYP_003182895 
Protein GI257792289 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase 
TIGRFAM ID[TIGR00507] shikimate 5-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CGCAAGCCGA ACGGCCGGCC GAGAGCCTGT ACCTCTTGGG ACATCCCATT 
GCGCACTCTT CGTCGCCCGC GATGTACAAC GCCGTGTACG AGCGCTTAGG ATTGCCGTGG
CGCTACGGTT TGGCTGATTG CGCGACCGAG GAAGAGGCGC GCTCGTTCGT GGAGGCGCGC
GGCTTCCTGT CCATCAACAT CACCACGCCG TACAAGCCGC TCGCGTTCGA GGCCGCCACG
GCGAAGGCTG CCACGGCGAA GCTCGCCCAG GGCGCGAACG TGCTGGTGAA GAAGGGTGAC
GCGCTCATCG GCTTCAACAC CGACGGCCAG GGCTGCGTGG CGTACTTGGA GCGCACGGGC
TTTTGCTTCG CAGGCAAGCG CGTGGCCGTG TGCGGCACGG GCCCCACGGC GCTGTCCATC
CTGCACGCAT GCGCCATCGC CGGGGCGGAC GTGGCCATGC TGGTCGGGCG CGACAAGGAG
CGCTCCCGCA AGGTGCTCGA AGGCTACGTC GAGCGGTTCG GCCTGTTGGC AAATGCCACG
GTGGACCTGC CGGCAGCGCA GGCGCACCAT CGCAGCTTCC GCACGGCTTA CGAGCGCACC
ACGTTCAAGT TCGGCAGCTA CACCACGTCC ACGAAGGCGC TGGCCGCCGC CGATCTCGTG
GTGAACGCGA CGCCGCTCGG CATGAACGAG GGCGACGGCT CGCCGTTCGA CGTCGAGCTT
CTGAGCGCGG GGCAGACCGT GTTCGACGCG GTATACGGCC ACGGCGAGAC GGCGCTCGTG
CGCGCCGCGC GCGAAGCGGG ATGTACGGTG CACGACGGCG CCGGGATGCT GGTAGCGCAG
GCGGTGGCTA CCGTGCACGC CGTGTGCGAC CTCGCCGAGG TCGACGTCGC CCTGTCTGAC
GACGAGCTGT TCGCCTTGAT GGCGGAAGCG GCAGGGTTCG ACCTGTAG
 
Protein sequence
MTDTQAERPA ESLYLLGHPI AHSSSPAMYN AVYERLGLPW RYGLADCATE EEARSFVEAR 
GFLSINITTP YKPLAFEAAT AKAATAKLAQ GANVLVKKGD ALIGFNTDGQ GCVAYLERTG
FCFAGKRVAV CGTGPTALSI LHACAIAGAD VAMLVGRDKE RSRKVLEGYV ERFGLLANAT
VDLPAAQAHH RSFRTAYERT TFKFGSYTTS TKALAAADLV VNATPLGMNE GDGSPFDVEL
LSAGQTVFDA VYGHGETALV RAAREAGCTV HDGAGMLVAQ AVATVHAVCD LAEVDVALSD
DELFALMAEA AGFDL