Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5216 |
Symbol | |
ID | 7381345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 213879 |
End bp | 215264 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643648857 |
Product | nitrilotriacetate monooxygenase |
Protein accession | YP_002547094 |
Protein GI | 222106303 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.772157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAAC GGCTTATATT CAATTGCTTC ACGATGAATG TTCCCTCGCA TATCTACCAC GGGACATGGC GACATCCGAA CAATCAACTT GCTCACTTCA ATGAGTTCGA GACCTGGTCA AAGCTGGCAA CAAAACTGGA AGAGGGGCGG TTCGACGCGA TGTTCTTCGC GGATATTCTT GGCATCGATC CGGCCTATGA TGGCAAGTGG GATGCTTATT TCGAGCAAGG CCTCCATATG CCTTGCAACG ATACCTTCAC CTTGTGCGCG GCCCTTGCAG GGGTGACGAA AAACCTTGGT CTCGTTTTCA CTAGTTCGAT CCTGTCCGAG CATCCTTTCG CGTTTGCAAA GAAGGCCTCG ACCCTCGACC ATATCAGCGG CGGCCGTATC GGTTGGAACA TTGTCACAAG CGTGACGGAC AATGCCGCCC GCAACTTCGG TTACGACAAG ATCGTCCCGC ATGACCAGCG CTACGACTGG GCGGACGAAT ATATGTCCGT TCTCTACAAG CTCTGGGAAG GGTCCTGGGA GGACGGCGCG ATGATCGCGG ACCGTGAAAG CGGCGTGTTC AGCGATCACA CCAAAGTGCA TCGGATCAAC CATGTCGGAG AGCGTTACAA GGTGCAGGGG CCGCATCTGG TCAGCCCCTC TCCGCAACGG ACGCCGATGC TCTATCAGGC CGGTGCCTCC AAGCGTGGCA GCCAGTTCGC GGCGCAGCAT GCAGAAGGAA CTTTCGTTCT CTATCCGAAT GTCGACGGTG CCCGGATCGG GATTGCTGGC ACCAAGGCGG TGGCGGCGCA GGTGGGACGC GGTGCCGAAG ACCTCAAGTT CATTCAGGGC CTGTCCTTTG TCGTCGGCAG CACGATGGAA GAGGCGGAGC GGAAGGCTGC GGAGATCGAC GAATGGGTCA GCTATGAAGG CCTTGCCGCG CATGTCAGCC GCGACATGGG TGTCGATCTC TCCAATCTCG ACCCGGACAA GCCGGTCGAT GAGTCCGGTC TCGATGGCCT GCAAGGTTAC GCACGCATGA TCGAGATGGG CAAGCCCAAT GGCGAGAAGG CCACTGTCAA GGAGGTCGCA AACGCGCTTT CCTACAATTG CCGCATTGTC GGCACGCCGG ACAGTATCGC CGACGAATTG GCGCTTTGGC AGGATGCAGG TGTTGATGGC ATCAACATGA TCTGCCAATT GCATCCCGAC ACCTATATCG ATTTCATCGA TCACGTCACG CCGGTCTTGC AGGACCGTGG CCTTGCCCAG CGCGACTATG CCGAAGGTCC TTTACGGCAA AAACTGTTCG GTCACGGCCC GCGCCTTCCA GACAACCATC CGGGTGCCGC ACATCGCGGT GCGTTTTCCC AATCCACTCA GGTTGCTGCT GAATAA
|
Protein sequence | MRKRLIFNCF TMNVPSHIYH GTWRHPNNQL AHFNEFETWS KLATKLEEGR FDAMFFADIL GIDPAYDGKW DAYFEQGLHM PCNDTFTLCA ALAGVTKNLG LVFTSSILSE HPFAFAKKAS TLDHISGGRI GWNIVTSVTD NAARNFGYDK IVPHDQRYDW ADEYMSVLYK LWEGSWEDGA MIADRESGVF SDHTKVHRIN HVGERYKVQG PHLVSPSPQR TPMLYQAGAS KRGSQFAAQH AEGTFVLYPN VDGARIGIAG TKAVAAQVGR GAEDLKFIQG LSFVVGSTME EAERKAAEID EWVSYEGLAA HVSRDMGVDL SNLDPDKPVD ESGLDGLQGY ARMIEMGKPN GEKATVKEVA NALSYNCRIV GTPDSIADEL ALWQDAGVDG INMICQLHPD TYIDFIDHVT PVLQDRGLAQ RDYAEGPLRQ KLFGHGPRLP DNHPGAAHRG AFSQSTQVAA E
|
| |