Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2465 |
Symbol | |
ID | 3905077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2905915 |
End bp | 2907402 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637879795 |
Product | tryptophan halogenase |
Protein accession | YP_481561 |
Protein GI | 86741161 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.842716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.17253 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGATT CTGCGGAATT TGATGTGGTG GTCGTTGGCG GCGGACCCGC CGGCTCCACA CTGGCCGCGC TGGTGGCCAT GCAGGGGCAT CGAGTCCTTG TCCTGGAGAA GGAGCACTTT CCGCGCTACC AGATCGGCGA GTCGCTGCTA CCATCCACTA TCCACGGGGT CTGCCGGCTG ACCGGCGCCG CCGACGAACT GGCCAAAGCC GGCTTCCCGC TCAAGCGCGG CGGTACCTTC AGATGGGGGG CCACCCCGGA GCCGTGGACG TTCGCCTTCT CGGTGTCGTC GCGGATGGCT GGGCCGACCT CATTCGCCTA TCAGGTTGAA CGGTCGAAAT TCGACGAGAT TCTACTGCGG AACGCCCGCC GGGTCGGCGC CGAGGTACAC GAGGGCTGCT CGGCCACCGA CGTCATCGAG GACGGCGACC GGGTCGTCGG CATCCGCTAC ACCGACGACG GCGGCAACCG GCGTGAGGCG CGGGCCTCCT TCGTGGTCGA CGCCACCGGC AACAAAAGCC GCATCTACCA TCGGGTTGGT GGCACCCGGC AGTACTCGGA GTTCTTTCGC AGCCTGGCCC TGTTCGGCTA CTTCGAGGGC GGCCGGCGGA TGCCCGAGCC CAACCGGAAC AACATCCTGT GTGTGGCCTT CGACAGCGGC TGGTTCTGGT ACATCCCACT GAGCGACACG CTGACCAGCG TCGGCGCGGT CGTACGGTCG GAAATGGCGG AGAAGGTCCA GGGTGACTCC GAGCAGGCCA TGAAGGCGCT CATTGAGGAG TGCCCGATGA TTTCGGATTA CCTCGCGCCG GCCAGGCGGG TCACCACCGG GCAGTACGGC CAGCTCCGGG TACGCAAGGA CTACTCCTAT CATCAGACGA CTTTCTGGCG TCCCGGGATG GTTTTGGTCG GCGACGCCGC GTGCTTTGTG GACCCGGTGT TCTCCTCCGG CGTGCACCTC GCGACCTACA GCGCGCTGCT CGCGGCCCGG TCCATCAACA GCGTCCTCGC CGAGATTGTG GACGAGAAGA CCGCGATGCA GGAGTTCGAG GCCCGCTACC GCCGAGATTA CGGCGTGTTC TACGAGTTTC TGGTGTCGTT CTACGAGATG CATCACAGCG AGGACTCCTA CTTCTGGCAG GCCAAGAAGG TCACCGGGAA CAGCCAGCCC GAGCTGGAGG CATTCGTCGA GCTGATCGGC GGAGTGTCGT CGGGGGAATC CGCGCTGACC GACGCCGACG CCCTGGCCCT CCGGCTGCAG GCCAACACCG CCGACTTCAC TACCGCGGTC GACGCGCTCG TGGCCAACAA CAGCGAGAGC ATGGTGCCGT TCATGAAGTC GCAGGTGATC CGCGGGGTCA TGCACGAGGG CTCGCAGATG CAGATGCGCG CGCTGCTCGG TGAGGACGCC GAGCCGGAGA CCCCGCTGTT CCCCGGCGGT CTGGTCTCGT CGGCTGACGG CATGTTCTGG CTGCCCACCG ACGCTTAG
|
Protein sequence | MTDSAEFDVV VVGGGPAGST LAALVAMQGH RVLVLEKEHF PRYQIGESLL PSTIHGVCRL TGAADELAKA GFPLKRGGTF RWGATPEPWT FAFSVSSRMA GPTSFAYQVE RSKFDEILLR NARRVGAEVH EGCSATDVIE DGDRVVGIRY TDDGGNRREA RASFVVDATG NKSRIYHRVG GTRQYSEFFR SLALFGYFEG GRRMPEPNRN NILCVAFDSG WFWYIPLSDT LTSVGAVVRS EMAEKVQGDS EQAMKALIEE CPMISDYLAP ARRVTTGQYG QLRVRKDYSY HQTTFWRPGM VLVGDAACFV DPVFSSGVHL ATYSALLAAR SINSVLAEIV DEKTAMQEFE ARYRRDYGVF YEFLVSFYEM HHSEDSYFWQ AKKVTGNSQP ELEAFVELIG GVSSGESALT DADALALRLQ ANTADFTTAV DALVANNSES MVPFMKSQVI RGVMHEGSQM QMRALLGEDA EPETPLFPGG LVSSADGMFW LPTDA
|
| |