Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0074 |
Symbol | |
ID | 5668499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 91442 |
End bp | 92632 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239002 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001504447 |
Protein GI | 158311939 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGCA GCGAGCTCAT GGACCTCACC CGGCGCGCGC TCAAAGTCGC CACGGACCGG ACAACCGACA TGGAGCCGGC TGAACGCCGT CAGCCGGCCG ATGCCTACAC CAGCCAGGAA AGTTTCGAAC GCGAACGCGA ACTCGTCCTC CGCTCGCCAC AGCTGGTCGG CTACCGCTCG GAGCTGCCGA CCGCCGGAAG TTTCTGCACG AAGACGGTGA TGGACGTTCC CGTACTCCTG ACCCGGAGCC AGGACGGCAC CGTCAGGGCC TTCCACAACA TCTGCGCTCA TCGCCAGGCA CCGGTCGCGG TGCGCTGCGG CACAGCCGAA CGGTTCGTGT GCCCGTACCA CGCCTGGACG TACGACACAC AGGGCCGCTT CGTCGGAGGA CCCGGCCGTG AGGGTTTCCC GTCGCTGACG GCTGGCGGAA GCGGCCTCAC GGAACTGCCG GCCGCGGAGC ATGCCGGGTT TCTGTGGGTC GGCCTCCAGC CGGAGAACGG GCCCCTGGAC ATCGAGGCCC ATCTGGGGCC GCTCGGCCCG GAGCTTGCCT CGTGGGGGAT CGGCGACTGG TCGCTGGTGG GCGAGCGGGT ACTCGACTCT CCGATCAACT GGAAGCTCGC TCTGGACACC TTCGCCGAGA GCTATCACTT CTCCACCCTG CACCGCGGCA CGTTCGCCCA GCTCGCCCTG GGAAACTGCG CGCTTTTCGA CTCGTTCGGC CCGCACCATC GGCTGGTCGT TCCGTTGCGG CACATCACGC GCCTCACGGA CCTTCCCGCC GAAGAGTGGA AGCCGCTGGA CAACCTGTCG ATCGCTTATG CGCTGTTTCC TAACATCGTC GTCTCGGTGA GTGCCGCCAA CAGCGAGGTC TTCCGGATCT ACCCCGGCAG CGGGCCCGGG CACTCGGTGA CCTGCCACCA GAACGCCTCC GGGCTGGACC TCGAAGACGA GACCACCCGG GCGGGTGCGG AGAGACTGTT CGACTTCGCG CACTCGACAG TCCGCGACGA GGACTATCAG CTCGCTGTCG AGATCCAGAA GAACCTTTCC TCGGGCGCGC GATCCGAACT GGTCTTCGGG CGCAACGAGC CGGGTCTTCA GCACCGTCAC TTCGTCCTCG ACGCAAGGAT GGGCCGTCAG CCGCGGCCAG CCGGGCAGCC AGGGCGAGTG GACATTGTTC ATCGTTCATG A
|
Protein sequence | MDRSELMDLT RRALKVATDR TTDMEPAERR QPADAYTSQE SFERERELVL RSPQLVGYRS ELPTAGSFCT KTVMDVPVLL TRSQDGTVRA FHNICAHRQA PVAVRCGTAE RFVCPYHAWT YDTQGRFVGG PGREGFPSLT AGGSGLTELP AAEHAGFLWV GLQPENGPLD IEAHLGPLGP ELASWGIGDW SLVGERVLDS PINWKLALDT FAESYHFSTL HRGTFAQLAL GNCALFDSFG PHHRLVVPLR HITRLTDLPA EEWKPLDNLS IAYALFPNIV VSVSAANSEV FRIYPGSGPG HSVTCHQNAS GLDLEDETTR AGAERLFDFA HSTVRDEDYQ LAVEIQKNLS SGARSELVFG RNEPGLQHRH FVLDARMGRQ PRPAGQPGRV DIVHRS
|
| |