Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3786 |
Symbol | |
ID | 5672150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4489125 |
End bp | 4490330 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242665 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_001508085 |
Protein GI | 158315577 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAG AAGTGACGCT GACCGGCCAG GAGCGCTCGG CCGAGCTGGA CGTCGACCAG CTCCGACAGC TGGTCGGTCT CGTCGAGCAC GACCCGGCCG GCGACCCGTT CCCGGTGTCC GGCTGGGACG CGGTGGTGTG GGTGGTCGGC AACGCCAAGC AGGCGGCCCA CTACTACCAG TCGGCGTTCG GCATGGACCT CGTTGCCTAC TCCGGGCCGG AGACGGGCCA ACCCGACCAT TGCTCCTACG TCCTCACCAG CGGCGCGGTG CGGTTCGTGT TCAAGGGGGG AGTGCGCCCG GACAGCCCGC TGCTGGACCA CCACCGCCGG CACGGCGACG GCGTCGTCGA CATCGCCCTG GAGGTGCCCG ACGTCGACCG GTGCATCGCG CACGCCCGCG CCCAGGGCGC CCGGGTGATC GAGGAGCCGC ACGAGCTGCG CGACGAGCAC GGCGTCGTGC GCCTCGCCGC CATCGCCGCC TACGGGCGGA CGCGGCACAC GCTGGTCGAC CGGTCGCGCT ACTCCGGGTG CTACCTGCCG GGCTACGTCG AGCGCCGCTC CGGGCACGTC CGGCGGCCGG GTGCGCCGCG CAGCCTGTTC CAGGCTCTGG ACCACGTCGT GGGCAACGTC GAGCTCGGCG CCATGGACGA GTGGGTGGCG TTCTACAACC GGGTCATGGG CTTCACGAAC CTGGCCGAGT TCATCGGCGG CGACATCGCC ACCCGGTACT CGGCGCTGAT GAGCAAGGTG GTCGCCAGTG GCAACCACCG GGTCAAGTTC CCGCTCAACG AGCCCGCGCC CGGCCGGCGG AAGTCGCAGA TAGCGGAGTA CCTGGAGTTT CACGGTGGTC CCGGCGCGCA GCACCTCGCC CTGGCGACCG GTGACATCCT CGCCTCGGTC GACGCGATGC GGGCCGGGGG CGTCGAGTTC CTCGACACCC CCGACACCTA CTACGACGAC CCCGCGCTGT GGGCCCGCGT CGGCGAGGTG CGGGCACCGG TCGAGGAGCT CCGGCGACGC CGGATCCTGG TCGACCGCGA TGAGGACGGA TACCTCCTGC AGATCTTCAC CAGGCCGCTG GGGGACCGTC CGACGGTGTT CTTCGAGCTC ATCGAGCGGC ACGGATCGCT CGGCTTCGGC AAGGGCAACT TTCAGGCCCT CTTCGAGGCG ATCGAACGCG AGCAGGAACG GCGGGGAAAC CTCTGA
|
Protein sequence | MSTEVTLTGQ ERSAELDVDQ LRQLVGLVEH DPAGDPFPVS GWDAVVWVVG NAKQAAHYYQ SAFGMDLVAY SGPETGQPDH CSYVLTSGAV RFVFKGGVRP DSPLLDHHRR HGDGVVDIAL EVPDVDRCIA HARAQGARVI EEPHELRDEH GVVRLAAIAA YGRTRHTLVD RSRYSGCYLP GYVERRSGHV RRPGAPRSLF QALDHVVGNV ELGAMDEWVA FYNRVMGFTN LAEFIGGDIA TRYSALMSKV VASGNHRVKF PLNEPAPGRR KSQIAEYLEF HGGPGAQHLA LATGDILASV DAMRAGGVEF LDTPDTYYDD PALWARVGEV RAPVEELRRR RILVDRDEDG YLLQIFTRPL GDRPTVFFEL IERHGSLGFG KGNFQALFEA IEREQERRGN L
|
| |