Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1080 |
Symbol | |
ID | 8415370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1306099 |
End bp | 1307190 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645024043 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_003181440 |
Protein GI | 257790834 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.463173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000000000154847 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTAGCG TGTGCGCGGC CGCGCCCCAG CTGGCGGGCG TGGTGCCCTA CGATCCGAAG TATTTGCCCG TCACCGCCGT CCTCTCGGCG AACGAGAACC CGCACGACGT CGACGACGAG ATCCGCCGCG ACATCATGCG CGAGGTGAAG CGCCTGCCCC TCAACCGCTA CCCGGACCCG CTGGCCAACG ACCTGCGCGA CATGATCGCC GAAGCGAACG GGCTCGACCG CGACCAGGTG CTGGTGGGCA ACGGGGGCGA CGAGCTCTTG TTCAACCTGG CGCTGGCGTG GGGCGGGCCG GGGCGCACGT TCCTCAACCT GCCGCCCACG TTCTCGGTGT ACGACGCGAA CGCCCGCCTC ACGAACACGT CGGTGGTGGA CGTGCCGCGC CGCGCCGACT TCTCCATCGA CGAGGAGGCC GTGCTGGCGC GCGTCGCCGA GGGCGGTATC GACTACCTGG TGGTGACCAG CCCGAACAAC CCCACGGGGC AGCTTGCCAG CGAGACGTTC ATCCTCCGGC TGCTCGACGC CACCGATGCG CTCGTGATGG TGGACGAGGC TTACTTCGAG TTCTCGCGCC AGACGATGCG CCCGTATCTG GCGCAGCACA AGAACCTCGT CATCCTGCGC ACGTTCTCGA AGGCGTTCAG CCTGGCCGGG GCGCGTATGG GCTACATCCT GGGAGACGCC GAGGTCGTGC GCGAGTTCGT CAAGGTGCGC CAGCCGTATT CGGTGGACGC CGTCTCGCAG GCCGTTGCGC GCGTGGTGTA CGCGAACCGC GCGAAGTTCG AGCGCGGCAT CCTGGCCGTC ATCGAGGAGC GCGCCCGCCT GATCGAGGGA CTGAAGAGGA TTCCCGGCGT GAAGCCCTTC CCGTCGGATG CGAACTACGT GCTGTTCCGC GTGGAGAACG CGCCCGTCAT CTGGGAGGCG CTGTACGAGC GCGGTGTGCT TGTGCGCGAT TTCTCGCGTG CGGCGCATCT GGAGAACTGC CTGCGCGTGA CCGTGGGCGC CTCCGAGGAG AACGACGCGT TTTTGCGCGC GCTGCGCGAT GCGGTGATGG GCAAGTGCGA TTTGAAGGTT CCGTCGACCT GA
|
Protein sequence | MRSVCAAAPQ LAGVVPYDPK YLPVTAVLSA NENPHDVDDE IRRDIMREVK RLPLNRYPDP LANDLRDMIA EANGLDRDQV LVGNGGDELL FNLALAWGGP GRTFLNLPPT FSVYDANARL TNTSVVDVPR RADFSIDEEA VLARVAEGGI DYLVVTSPNN PTGQLASETF ILRLLDATDA LVMVDEAYFE FSRQTMRPYL AQHKNLVILR TFSKAFSLAG ARMGYILGDA EVVREFVKVR QPYSVDAVSQ AVARVVYANR AKFERGILAV IEERARLIEG LKRIPGVKPF PSDANYVLFR VENAPVIWEA LYERGVLVRD FSRAAHLENC LRVTVGASEE NDAFLRALRD AVMGKCDLKV PST
|
| |