Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2065 |
Symbol | |
ID | 3904638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2428607 |
End bp | 2429926 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637879401 |
Product | tryptophan halogenase |
Protein accession | YP_481167 |
Protein GI | 86740767 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.215345 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGA ATACTGAAGT CGTCGACGTT GTGGTCATCG GCGCCGGGCC GGCAGGTGCG GCGGCGGCAG CGACGCTCGC GAAGGCCGGG CGTAGCGTGA TTGTGTTGGA ACGGCGGACC TTTCCCCGTT TTCACATCGG CGAGTCCATG GTGCCGTTCG TCAACGCCGC GTTCGAGAAG CTGGGAATAC TCGACCGGCT CAAGCAGCAG GGTTATGTCG CCAAGCATGG TGCCGAGTTC TGCCAGGGAG AGGAGGACGA GAACGTCCGC GTTTCGTTCA CAACGCAGGG GCCCGGTCGC CACCACGTGA CGTTCCAGGT AGAGCGGGCG CACCTGGACA ATATGCTCAT TCAGTTCGCG GGCGAGTGCG GTGCGCGGGT GATCCACGAG GCGACGGTAC ACGACCTGAT AACAGAAGGC GACCGGGTCG TCGGTGTGCG CTACGAACAC GACGGGACGG CTCGCGAGGT GCGGGCACAG TACGTTCTCG ACGCCGGCGG GCGGGCCAGC AAGATCGCCA AGGCGTTCCG GCTGCGGAAG CCGGTCGACC GACTGAAGAT GGTCGCGGTC TTCCGCCACC TCAAGGGCAT CGACGAGGCG CGCAATCCCG GCTTCGAGGG CGACATCCAG GTGGGCGCCC ACGAGGACGG ATGGCTGTGG GCCATTCCCA TCTGGCCCGA CACGATGAGC GTCGGCGCGG TCATGCCGCA ACAGGTGCTG CGGTCCGGCG ATCCCGCCGC GCTGTTCGAC GAGCACGTGT CCCGGGTACG GCGCGTCCGG GAGCGGGTCG CGGGCGCCCA TCCGGTGAGC GACGTGCAGA TCGAAACCGA CTACTGCTAC TACTCGGACC AGGTCGCGGG GCCGGGCTGG TTCCTAGCCG GCGACGCGGG CTGCTTCTTC GACCCGATCT TCTCCGGCGG TGTCTACCTC GCCACCTCCA CCGGCATCCG CGCCGGCGAG TCCATCGACG CCGCCCTGCG GGAGCCGCTC CGCGCCGAGG AGCTCCAGAA CGAGTACCAG CGGTTCTACA AGACCGGCTA CGACATGTAC GCCCGGCTGA TCTACATGTA CTACGAGGAG CCGGATCCTG ACGCCTACCT GGCGTCCGTC GGACTCGACG ACTGCGGCGA CGCCTTCGCG AGCAACAAGT GGGTCGTCCG CTTCCTCTGC GGGGACTTCT TCAACGCCCG GAACAAGCTC GCCCAGGAGG TCGTCAAGGA GCGTCGCTGG GACACCTTCG CGCCGTTCGA GCGCGTCAGC GAGTGCCCGT ACTACGCCGA ACTGAACGAG GCGGAGGACA GGGAGCCCGT CGAGGCGTAA
|
Protein sequence | MAENTEVVDV VVIGAGPAGA AAAATLAKAG RSVIVLERRT FPRFHIGESM VPFVNAAFEK LGILDRLKQQ GYVAKHGAEF CQGEEDENVR VSFTTQGPGR HHVTFQVERA HLDNMLIQFA GECGARVIHE ATVHDLITEG DRVVGVRYEH DGTAREVRAQ YVLDAGGRAS KIAKAFRLRK PVDRLKMVAV FRHLKGIDEA RNPGFEGDIQ VGAHEDGWLW AIPIWPDTMS VGAVMPQQVL RSGDPAALFD EHVSRVRRVR ERVAGAHPVS DVQIETDYCY YSDQVAGPGW FLAGDAGCFF DPIFSGGVYL ATSTGIRAGE SIDAALREPL RAEELQNEYQ RFYKTGYDMY ARLIYMYYEE PDPDAYLASV GLDDCGDAFA SNKWVVRFLC GDFFNARNKL AQEVVKERRW DTFAPFERVS ECPYYAELNE AEDREPVEA
|
| |