Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_29430 |
Symbol | |
ID | 7761845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3032595 |
End bp | 3033905 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643805817 |
Product | nitrilotriacetate monooxygenase |
Protein accession | YP_002800085 |
Protein GI | 226945012 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.64384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCACCTGA ATCTTTTCGT TCACGGTCGC GGCCATCACG AAGCCTCCTG GCGCCATCCG CTCGGTACAC GGCAACGGCT GACCGACCTC GACTACTACA AGCATCTGGC GAGCGTTGCG GAGAAGGGGC TGCTCGACTC GCTGTTCCTG GCCGACGCCC TGTCGATGGA TGGCAGCATC CGCCATGTGG CGACCGGCGG ACTCGAACCC ATCACCCTGG TAGCGGCCCT GGCCGGCGCG ACCTCGAACA TCGGTCTGGT CGCCACGGCC TCGACCACCT ACACCGAACC CTTCAACCTG GCCCGTCAGT TCGCTTCCCT GGACCATATT TCCAATGGCC GCATCGGCTG GAACATCGTC ACTTCGTGGA CACAGGGCGC GGAAGCGAAC TTCGGCTTCG AACAGCAGCC GCCGCACGCC GAACGCTATG CACGAGCCTT CGAATTCCTC GAAGTGGTCA CCGGCCTCTG GAACAGTTGG TCCGACAACG CCATCGTCGA CGATCCAGCC AGCGGCCTGT TCCTCGAACC CTCGCTCATC CGCGCCATCG ATCACCAGGG CGCCCACTTC CGCGTCAAAG GCCCGCTGAA CGTGCCGCGC TCGCCCCAGG GCTACCCGGT GCTGTTCCAG GCCGGCTCTT CCGCCGGGGG CCAGCGTTTC GCCGCCCGCT ACGCCGAGGC GGTATTCACC GCGCAGCCGG ATCTTGCCTC CGCGCAAGCC TTCTACCGAA GCCTCAAGGA GCAGACGGTC GCCGCCGGCC GGCGCAAGGA AGACATCGCC ATCCTGCCGG GAATCAGCCC GGTCATCGCC GCCAGCGACC GCGAGGCCGA TGCCCTGTGG CGAGAGCTGA ACGAGCTGAC GGCCGTGGAA ACCGGACTGG CCCGCCTGTC GAATCGTTTC GGCGGCCACG ACTTCAGCCA TCTGCCGCTG GACCGGCCGC TCAGCGTCGA CGACTTTCCC GATCCGCACG GGGTGCAGGC GGCGCAAAGC CGTGCCGTGG TCATCACCGA TCTGGTCCGG CAACAGCGGC CGACGCTGCG CGAGCTGCTG CACCGGTTGG CCGGCGCCCG CGGCCATTTC ACCCTGGCCG GCAGCCCGGA GCGGATCGCC GACACCATCC AGACCTGGTT CGAAGAAGGC GCCGCGGACG GCTTCAACCT GATGCCGCCC ATCCTGCCGG CCCTGCTGGA GACCTTCGTC GAAGAGGTCG TCCCGCTGCT GCAGAAGCGC GGGCTGTTCC GCACCCACTA CGAAGGCACG ACGCTGCGCG ATCGCTATGG CCTGAAACGA CCGTCCAATC CCTATTTCTG A
|
Protein sequence | MHLNLFVHGR GHHEASWRHP LGTRQRLTDL DYYKHLASVA EKGLLDSLFL ADALSMDGSI RHVATGGLEP ITLVAALAGA TSNIGLVATA STTYTEPFNL ARQFASLDHI SNGRIGWNIV TSWTQGAEAN FGFEQQPPHA ERYARAFEFL EVVTGLWNSW SDNAIVDDPA SGLFLEPSLI RAIDHQGAHF RVKGPLNVPR SPQGYPVLFQ AGSSAGGQRF AARYAEAVFT AQPDLASAQA FYRSLKEQTV AAGRRKEDIA ILPGISPVIA ASDREADALW RELNELTAVE TGLARLSNRF GGHDFSHLPL DRPLSVDDFP DPHGVQAAQS RAVVITDLVR QQRPTLRELL HRLAGARGHF TLAGSPERIA DTIQTWFEEG AADGFNLMPP ILPALLETFV EEVVPLLQKR GLFRTHYEGT TLRDRYGLKR PSNPYF
|
| |