Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0177 |
Symbol | |
ID | 8414461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 245994 |
End bp | 247163 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645023157 |
Product | urea amidolyase related protein |
Protein accession | YP_003180560 |
Protein GI | 257789954 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTGG TCGTCAAAAA ACCCGGTGTG ATCACCACCA TCCAGGACGT GGGGCGCTTC GGCTATCAGG GCAGCGGCTT CTCCACGAAC GGCGTGATGG ATCATCGCGC GTTCGCCATC GCGAACCTCC TGGTGGAGAA CGACCCCGGC GCACCCGTGC TGGAATTCGC GCTGGCGGGA CCCACGGTGC GCTTCACCAC GAACACCTGC TTCGCCATCA CGGGCGGCGA CTTCGCGCCG ACGCTGGACG GCGAACCCGT GCCCATGTAC GCGGCCGTCA TGGTGCGGCG CGGCTCCATC CTGCGCTTCA AGGCGTCGCG CACCGGCTGC TACGGTTACT TGGCCATCGC GGGCAACAGC GTGCGCGTGC CCGAAGTGAT GGGCAGTCGC TCGACGAACC TCAAGTGCGA GTTCGGCGGA TGGAAGGGTC GCACGCTCAT CGTCGGCGAC TACCTGCCGT TTTCCACGAA GAGCGTGGAC TTCCTGCCCA ACCTAGGGTC GCACCGCATC GACGGCGACA ACGAGTTCTA CGGCTTCGAC CGCGACGAAA TCACCGTGCG CGTGGTGCCC GGGCCTCAGC AGGACATGTT CACCGACAAA GGCCTGGCTA CGTTCTACGG GCAGGCGTAC ACCACCACCA CGAAATGCGA CCGCATGGGG TACCGCCTGG ACGGACCGGA AATCGAGACG AAGCACGGGT CCGACATCAT CTCCGACGGC GTGGCGTTCG GCGCCGTGCA GGTACCGTCG CACGGACGGC CCATCATCAT GCTGGCCGAC CGGCAAACTA CCGGCGGTTA CGCGAAAATC GGCACCATCG CCAGCGTGGA CATCCCGAAG CTGGTGCAAC GTCCGCCGGG CGGCAAGATC CGCTTCGAGT CCATCGGCGT GCAGGAGGCC CAAGCGCTGC TGCGCGAAGA GGCGCACCTG TTCGAAATGC TGGCGTTGAA AGTGAGGCGG CCCAGCGCCG ACGGCATATC GCCCCGGCGC ACGGCGCGGC GCCTGACCCC CATCCTGGAG GAACAGGCAC GCAAGTCGCA GGCCGACATG TTGTGGATCG ACCGAGCCGA TCGCTCCCAG CGCGTCGGCA ACCGCAACGC CCTGGGAACC GTCCCGCCAA AGAAACAACC GCCCGACGCA ACCGACACCA CCGAACGCGC AACGACGTAG
|
Protein sequence | MGLVVKKPGV ITTIQDVGRF GYQGSGFSTN GVMDHRAFAI ANLLVENDPG APVLEFALAG PTVRFTTNTC FAITGGDFAP TLDGEPVPMY AAVMVRRGSI LRFKASRTGC YGYLAIAGNS VRVPEVMGSR STNLKCEFGG WKGRTLIVGD YLPFSTKSVD FLPNLGSHRI DGDNEFYGFD RDEITVRVVP GPQQDMFTDK GLATFYGQAY TTTTKCDRMG YRLDGPEIET KHGSDIISDG VAFGAVQVPS HGRPIIMLAD RQTTGGYAKI GTIASVDIPK LVQRPPGGKI RFESIGVQEA QALLREEAHL FEMLALKVRR PSADGISPRR TARRLTPILE EQARKSQADM LWIDRADRSQ RVGNRNALGT VPPKKQPPDA TDTTERATT
|
| |