Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1300 |
Symbol | |
ID | 4446189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1459104 |
End bp | 1460459 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639689108 |
Product | putative nitrilotriacetate monooxygenase |
Protein accession | YP_830794 |
Protein GI | 116669861 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0896409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCCC ACCAGCGCAC CCTGCATCTC AACGCCTTCC TCATGAGTAC CGGCCACCAC GAAGCGTCCT GGCGGCTGCC CGAAAGCAAC CCCCGGGCGA GCACGGACAT CACCCACTAC CGCAATCTCG CCCGGATCGC CGAGCGGGGC ACGTTCGATT CCATCTTCTT TGCCGACTCG CCCGCGATCT TCGGAAACGT GGGCCGCCGG CCGGCCGGAA AGCTCGAACC GACAGTGCTG CTGACCGCCA TCGCCGCAGC CACCGAGAAG ATCGGCCTCA TTGCCACGGC CTCCACCACG TACAACGACC CGTTCAACCT CGCGAGGCGC TTTGCATCCG TCGACTTCGT CAGCGCGGGC CGTGCCGGCT GGAACGTCGT CACTACCGCC GGGCCCGACG CCGCCCGGAA CTTTGGCGTC GACGACCAGC CCGCGCACGC AACGCGCTAT GAGCGGGCGG CAGAGTTCCT GGACGTCGCC GGCAAACTCT GGGACAGCTG GGACGATGAC GCCGTGGTGG CGGACAAGGA GGCGGGCGTC TGGGCCGACC CGGACAAGGT CAGGCCGATC GACCACGTCG GAAAGCATTT CCGGGTTCGC GGGCCGCTCA ACGTTCCGCG GTCCCCGCAG GGATACCCGC TGATTGTCCA GGCGGGCTCC TCCGAAGACG GCAAGGGCCT CGCCGCGCGC TACGCCGAAG CCGTGTTCAC GGCGCAGCAG ACGCTCGAGG ACGCGCAGAG CTTCTACAGC GACCTCAAGG CCCGCACCGC AGCCGCCGGA CGAGACCCTG AAGGCATCAA GATCCTGCCC GGAATCGTGC CCGTGCTGGC AGGGACAGAA GCCGAGGCGA AGAAGCTGGA GCGTGAACTC GACGAGCTGA TCCGTCCGGA ATATGCGCGG ATAGAACTTG CCAAAACCCT CGGTGTCAGC CCGGACGACC TCCCGCTGGA CCGGCAACTG CCGGCAGACC TTCCCGATGA GGATTCCATC CAGGGTGCCA AGAGCCGCTA CACCCTGATC GTGGAGCTCG GCCGTCGCGA ACGGCTCACA GTGCGGCAGC TGATCGGCCG CCTGGGCGGC GGCCGCGGAC ATCGCACGTT CTCGGGTACT CCGGAGCAGG TGGCCGATGC CATCCGGCTC TGGTTTGAAA ACGGCGCGGC CGACGGCTTC AACATCATGC CCGCGGTGCT CCCCTCAGGG CTTGAGGCCT TTGTGGACCA CGTGGTGCCG GTCCTGCGCC GGCGGGGCCT GTTCCGGACG GAATACACAG CCGACACGCT GCGGGGGCAC TACGGACTGG AGCGCCCGCT CAACCGGTAC TCCGGTGCTG CCGGACTGGC TGGCGCGCAC GCCTAA
|
Protein sequence | MASHQRTLHL NAFLMSTGHH EASWRLPESN PRASTDITHY RNLARIAERG TFDSIFFADS PAIFGNVGRR PAGKLEPTVL LTAIAAATEK IGLIATASTT YNDPFNLARR FASVDFVSAG RAGWNVVTTA GPDAARNFGV DDQPAHATRY ERAAEFLDVA GKLWDSWDDD AVVADKEAGV WADPDKVRPI DHVGKHFRVR GPLNVPRSPQ GYPLIVQAGS SEDGKGLAAR YAEAVFTAQQ TLEDAQSFYS DLKARTAAAG RDPEGIKILP GIVPVLAGTE AEAKKLEREL DELIRPEYAR IELAKTLGVS PDDLPLDRQL PADLPDEDSI QGAKSRYTLI VELGRRERLT VRQLIGRLGG GRGHRTFSGT PEQVADAIRL WFENGAADGF NIMPAVLPSG LEAFVDHVVP VLRRRGLFRT EYTADTLRGH YGLERPLNRY SGAAGLAGAH A
|
| |