Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2379 |
Symbol | |
ID | 8416703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2797116 |
End bp | 2798174 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645025363 |
Product | 6-phosphogluconolactonase |
Protein accession | YP_003182726 |
Protein GI | 257792120 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2706] 3-carboxymuconate cyclase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.799827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCG ATGCCGCTCC CGCCCTGCGG CGGCTGTTCG TCGGCAGCTA CACCGACGGC GCCGATCGCG GCGGTATCCG CACGATCGCG TTCGACGAAA GCGGCGACGA GGCGCACGTC GTGCGCGCGG ACAGCTGCGG GGCGAACCCA ACCTATCTGG CGCAGCGCGG ATCGCTGCTG TTCGCCGCTC ACGAGCTGGA CTCCTGCGGG CGGATGGCCG CCTACGCCAT CGAGCCGGAC GGCTCGCTGA CCTGCCGCGG AGCGTGCACG GCGCCCTACG ATGCGGGCAC GTGCTTCGTG CTGCCCGATC CGAACGGACG CAACCTCTTC GGGGCGAACT ACTTGAGCGG ATCCGTCGCC TGCTGCGCGC TGCTGGACGA CGGGCGTCTG GCGGCGGGCG TGCCGTCGCG GCGCCACGAA GGGCGCGGTC TGCGGGACGA TCGGCAGGAG GGGCCCCATG TGCACTCGCT GAGCTTCGTG CCGGGAACGC GGCTGCTGGT CGCCGTGGAC CTGGGGCTCG ACGCGCTTGT GATCTACCAG GTTGACGCGT GCGGAATGCT CGCGCCGACG GCTGCGGAAA CCGTGCGCGT GCCGGCAGGG TCGGGGCCGC GCATGGTGGC GTACCACCCG CGTCTGCCGA TGGCCGCGCT GGTGAACGAG CTGGCGTGCG ACGTGCTGGT GTTCCGGATC GACGAAGGCG GGCTGCATTG GCGGATCGTC GAGCAGCTGA GTCTGCCGCA GGCGCCAAGC GGCGACGCGC TGGCGGCGCA TATCGCGTTC TCGCCCGACG GGCGGCAACT GTACGCGTCG GTGCGCGGAA GCGACCAGCT GGTCGTCTTC CCGGTGGACG ACCAAGGACG GGTTGCGGGG CGCTGCGACG TTGCGTCCGG AGGGAAGGGG CCGCGGCACT TTTCGCCGTC GCCCGACGGG CGCTTTCTGG CCGTCGCCAA CCTCGCGAGC GACGACGTGC GCCTGTTCGA ACGCGATGCC GACGGGATGC TGCGAGCGGT CGCGTGCGTG GACGTGCCAC AACCGGCGTG CGTCATCTGG AACGCGTAA
|
Protein sequence | MSADAAPALR RLFVGSYTDG ADRGGIRTIA FDESGDEAHV VRADSCGANP TYLAQRGSLL FAAHELDSCG RMAAYAIEPD GSLTCRGACT APYDAGTCFV LPDPNGRNLF GANYLSGSVA CCALLDDGRL AAGVPSRRHE GRGLRDDRQE GPHVHSLSFV PGTRLLVAVD LGLDALVIYQ VDACGMLAPT AAETVRVPAG SGPRMVAYHP RLPMAALVNE LACDVLVFRI DEGGLHWRIV EQLSLPQAPS GDALAAHIAF SPDGRQLYAS VRGSDQLVVF PVDDQGRVAG RCDVASGGKG PRHFSPSPDG RFLAVANLAS DDVRLFERDA DGMLRAVACV DVPQPACVIW NA
|
| |