Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lcho_2042 |
Symbol | |
ID | 6161950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Leptothrix cholodnii SP-6 |
Kingdom | Bacteria |
Replicon accession | NC_010524 |
Strand | + |
Start bp | 2219206 |
End bp | 2220282 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641664811 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001791074 |
Protein GI | 171058725 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0443757 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGATC TGTTCGAAAA CCCGATGGGC CTGATGGGCT TCGAGTTCGT CGAGTTCGCC TCGCCCACGC CGGGCGCGCT GGAGCCGGTG TTCGAGATGC TGGGCTTCAC GAAGGTGGCC ACGCACCGCT CGAAGAACGT GGTGCTGTAC CGCCAGGGCG GCATCAACTT CATCGTCAAC AACGAGCCGC GTAGCGCGGC CGCGTACTTC GCCGCCGAAC ACGGCCCTTC GGCCTGCGGC ATGGCGTTCC GGGTGCGCGA CTCGCACAAG GCCTACGCCC GCGCGCTCGA ACTCGGCGCC CAGCCGGTGG AGATCCCGAC CGGCCCGATG GAGCTGCGCC TGCCGGCGAT CAAGGGCATC GGCGGCGCGC CGCTCTACCT GATCGACCGG TTCGAAGAAG GCAAGAGCAT CTACGACATC GACTTCGATT TCCTGCCCGG CGTCGACCGC CACCCGGCGG GCCACGGCCT CAAGCTGATC GACCACCTGA CGCACAACGT CTACCGCGGC CGCATGGCGT ACTGGGCGGC GTTCTACGAA CGGCTGTTCA ACTTCAGGGA GCTGCGCTAC TTCGACATCA AGGGCGAATA CACCGGCCTG ACGAGCAAGG CCATGAGCGC GCCCGACGGC AAGATCCGCA TCCCGCTGAA CGAGGAATCG TCGCGCGGCT CGGGCCAGAT CGAGGAGTTC CTGATGCAGT TCAACGGCGA AGGCATCCAG CACATCGCCC TCTACACCGA CGACCTGCTC GGCACCTGGG ATTCGCTCAA GAAGGCCGGC CTGCCCTTCA TGACCGCCCC GCCGGCCACC TACTACGAGA TGCTCGAAGG CCGCCTGCCC GGCCACGGCG AGCCGGTGGG CGAACTGCAG GCGCGCGGCA TCCTGCTCGA CGGCAGCAGC ACGCCCGGCG ACCAGCGCCT GCTGCTGCAG ATCTTCTCGC AGACGCTGCT CGGCCCGGTC TTCTTCGAGT TCATCCAGCG CAAGGGCGAC GACGGTTTCG GCGAGGGCAA CTTCAAGGCA CTGTTCGAGT CCATCGAGCG CGACCAGGTG CGCCGCGGCG TGCTGGAGGC GGCATGA
|
Protein sequence | MSDLFENPMG LMGFEFVEFA SPTPGALEPV FEMLGFTKVA THRSKNVVLY RQGGINFIVN NEPRSAAAYF AAEHGPSACG MAFRVRDSHK AYARALELGA QPVEIPTGPM ELRLPAIKGI GGAPLYLIDR FEEGKSIYDI DFDFLPGVDR HPAGHGLKLI DHLTHNVYRG RMAYWAAFYE RLFNFRELRY FDIKGEYTGL TSKAMSAPDG KIRIPLNEES SRGSGQIEEF LMQFNGEGIQ HIALYTDDLL GTWDSLKKAG LPFMTAPPAT YYEMLEGRLP GHGEPVGELQ ARGILLDGSS TPGDQRLLLQ IFSQTLLGPV FFEFIQRKGD DGFGEGNFKA LFESIERDQV RRGVLEAA
|
| |