Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2756 |
Symbol | |
ID | 3906467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3246852 |
End bp | 3248708 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637880079 |
Product | tryptophan halogenase |
Protein accession | YP_481845 |
Protein GI | 86741445 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGATC ATTACGACGT TATTATCGCC GGCGGCGGGC CGGCGGGCTC TACACTGGCC GCGCTGCTCG CCCGCACGTC GGACCTGAAA GTAGCGATCT TCGAGAAGGA TGAGTTCCCG CGCGAGCACA TCGGCGAGTC GTTCGCGCAC CCGTTGATCC CGGTGCTGGC AGAGAGCGGA GCGCTCGCGA AGGTGCTGGC CAGCAACTGC TGGGTAAAGA AATTTGGCGG TATCTACAGC TGGGCGCGGC AGGGTCCGAG CCGGGCGTTC TTCGACCACG CGAACTGGGC GGTGGACGGG GTACACCGAT GGGCACTGCA CGTCAACCGT TCAGAGTTCG ACCAGATCCT GCTGGAACAC GCCCGGGACC TCGGAGTCGA CGTCACCACC GGGATGGCCG TCACCGACTT CGCCGCGGCC GCCAACGGCT GTCAGGTGAC GCTCGCCGAC GGCACCGCCG TGTCCGGCGC GTACTTCGTC GATGCCTCCG GTCGCCAGCA GAGCCTGGTC ACCAAAAAGC CTCGCGAATG GTTGTCCGGC TACCGGAACA TCGCGATCTG GCAGCACTAC CTGGGCGGCC TGCCGGCCCA GGGGCTCGAC GGCGACTGGA ACATCTTCCG GGAGAAGAAC CTCTCGCCGA TCGGTTGCTT CGCGTTTCCC GACGGCTGGT GCTGGTACAT CCCCGTCCCC AGGATCGTGA ACGGGGAGCG GGTGCTCACG CACTCGATCG GCATCGTGAC GAGCCCGGAG GTGCTGAAGG AACCCGGGAA GGACTTCACG GACTCCGAGG TCTTTCTGCG CACCGTCCGC GGCGTGCCGC GGCTGGCCGA CCTGGTGGCC GAGGTGACGC CGATCTCCGA CCAGATGATG ACGGTCACCA ACTACTCCCG GGTCAATGAG CGCTTCGCCG ACCTCGACCG GCACTGGATC CTGATCGGCG ATGCCTCCTA CTTCGTGGAC CCCCTCTTCT CGTCCGGGGT CGCGTTCGCG GCCAACCAGG CGGCCTCGGC GGCGCTGCTG CTGCGCACCA CGCTGCGCGC CGAACTCTCC CCGGGTCTGG TGAGAGATCT GTGGCAGGAC TACGACCACG AGTGGCACGG AATGGCCGAG GTGTTCGCGC TCTCGATTGA TCAGTGGTAT CACATGATCG GCGCGGACAA CCCGGGCAGC GCATACTGGC ACCGGCGCAA TTCGAGTCCG CATCTGGACA TGCCCGATCG GTCCTTCGAC GCGCTGCTCA ACACGGCGTT CACCCCCGAC CTGCTCCTGA TCATGACGCG CGGCACCGGC CGAATGTCCG ACCTGGCGAT CGACGGTCCG TACCAGCAGG CTCGCGCGCA CGTGATGCTG ACGGAGCCGG AACCGGACGC CGTGCTGGTC GCCGCTCCCG GAGTCCGGAT GCGGGCCGGC GTGGCGCTGG ATGTCCCCGG CTTCAAAGCG GTGCTCCCAC CGGCCGACCT CGAACTGGAC ACGCCCGCCG CGGTACGGGC CGCCGTCGCC GAGTACTGGA CCGATCCGGT GGCGGCCGAG GCGAACGGCG GCCTCGGCGT GCCTTCTCCG ACCGCCTCCC CAGTACCGTG CCACCGTTTC GAGTTCGATT CCGACGCACT CGTCGATTCC GGCTCAGGCG CGACGGGGTT TTCGGTGCGC GGGGTGGACA GTCACGACGG CGCACCGCAG CTGTGGGAGA TACTCAGCCG CGGTCCGGTC GTCTACGGCG AGCTCGGCTC GCGGCTCGCC CCCGGTCAGC GGGTGCTGCT GCAGCGGCTG ATAAAGGCCG GAATGGTCAC CGTCAAGACC GCGGCGAGGC AGACCACGCC GGGGACTGAA GCCGCCGAGG TCACCCTCGC GGACTGA
|
Protein sequence | MRDHYDVIIA GGGPAGSTLA ALLARTSDLK VAIFEKDEFP REHIGESFAH PLIPVLAESG ALAKVLASNC WVKKFGGIYS WARQGPSRAF FDHANWAVDG VHRWALHVNR SEFDQILLEH ARDLGVDVTT GMAVTDFAAA ANGCQVTLAD GTAVSGAYFV DASGRQQSLV TKKPREWLSG YRNIAIWQHY LGGLPAQGLD GDWNIFREKN LSPIGCFAFP DGWCWYIPVP RIVNGERVLT HSIGIVTSPE VLKEPGKDFT DSEVFLRTVR GVPRLADLVA EVTPISDQMM TVTNYSRVNE RFADLDRHWI LIGDASYFVD PLFSSGVAFA ANQAASAALL LRTTLRAELS PGLVRDLWQD YDHEWHGMAE VFALSIDQWY HMIGADNPGS AYWHRRNSSP HLDMPDRSFD ALLNTAFTPD LLLIMTRGTG RMSDLAIDGP YQQARAHVML TEPEPDAVLV AAPGVRMRAG VALDVPGFKA VLPPADLELD TPAAVRAAVA EYWTDPVAAE ANGGLGVPSP TASPVPCHRF EFDSDALVDS GSGATGFSVR GVDSHDGAPQ LWEILSRGPV VYGELGSRLA PGQRVLLQRL IKAGMVTVKT AARQTTPGTE AAEVTLAD
|
| |