Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5950 |
Symbol | |
ID | 7381036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 964941 |
End bp | 966407 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643649464 |
Product | crocetin dialdehyde |
Protein accession | YP_002547695 |
Protein GI | 222106904 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.444357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGTCC CATTTCCAAA TAAGCCTGAA TTTACTGGAT CGCTCTATAA GCCGGCCCGT TTTGAAGGCC AGGTTTATGA TCTCGAAGTG GAGGGCAAGG TGCCCGAGGA GATCGACGGC ACGTTCTTTC AGGTTGCTCC CGACCCGCAA TATCCCCCAA TGCTTGGAGA GGACATTTTC TTCAATGGCG ATGGCGCTGT CAGCGCCTTC CGGTTCAAGA ACGGCCATGT GGATTTTCAG CGTCGTTACG TGATGACCGA GCGATTGAAG GCTCAGCGCG ACGCCCGCGC CTCCCTGCAT GGGATCTATC GCAATCCCTT TACCAATGAT CCGAGCGTCA AGGATATTTC CAACTCCACC GCCAATACCA ATGTCGTTGT TCACAATGGC AAGCTGCTGG CGCTGAAGGA GGATAGCCCT CCTTATGCCC TCGACCCGAT CACGCTTGAA ACCATCGGTC TTTATGATTT CGATGGTCAG TTGACCAGCG CGACCTTCAC GGCCCATCCG AAATTTGACC CTGAAACCGG CGATCTGCTG TGTTTCGGCT ACGAGGCCAA AGGCGAGGCC ACACCCGACA TCGTCTATTA CGAGATCGAC AAGCATGGCC GGATGAAGCG TGAAGTGTGG ATCACTGCAC CTTATGCCGC AATGATCCAT GATTTTGCGG TCACGGAGCA TTTCGTGATT TTCCCGCTGA TGCCGTTAAC GGCGGATCTG GAGCGGATGA AGCAGGGCGG TAAGCACTTC CAGTGGCAGC CGGGTCTGGA TCAGTTGTTC GGCATTCTGC GCCGGGATGG CGACGGACGT GATGTTCGTT GGTTCAAGGC GCCCAACGGC TTCCAGGGCC ACACCCTGAA CGCCTTCGAT GACGGCGGCA GGATCTTTGT CGATATGCCC GTTACATCAG GCAATATCTT CTATTTCTTC CCGCAATCGG ATGGCACGGT GCCGCCGCCG GAAACCCTGT CGTCGCAGAT GATGCGCTGG ACCTTCGACA TGCGGTCGAA CGGTAACAAT ATCGAAGTGA AGCCGCTCAC CAGTTTCGCG TGTGAGTTCC CCCGCAGCGA CGACCGCTAT TGCGGCCGAC AATATCGCCA TGGCTTTGTG ATCGCCATGG ATCCAACCAA GCCCTTTGAC GAAGCCCGGA TCGGCCCGCG TCCGTTTCAG TTCTTCAACC AGTTGGCGCA TCTGGATATT GCAACCGGAA AGACCCAGCT CTGGTTTGCC GATGACCAGT CCTGTTTCCA GGAGCCGATC TTCGTGCCAC GCCGGCCGGA TGCGCCGGAA GGCGATGGCT ATGTGATCGG TCTGGTTAAT CGGTTGGCGG AGCGCGCCAC GGATCTCCTC GTGCTGGATG CGCAACATCT GTCCGACGGT CCGATTGCGA CCATCAAGCT GCCGATGCGT CTACGCATGT CGTTGCACGG CAATTGGGTG CCGGGCGATC AGCTGAAGGC GGTTTGA
|
Protein sequence | MTVPFPNKPE FTGSLYKPAR FEGQVYDLEV EGKVPEEIDG TFFQVAPDPQ YPPMLGEDIF FNGDGAVSAF RFKNGHVDFQ RRYVMTERLK AQRDARASLH GIYRNPFTND PSVKDISNST ANTNVVVHNG KLLALKEDSP PYALDPITLE TIGLYDFDGQ LTSATFTAHP KFDPETGDLL CFGYEAKGEA TPDIVYYEID KHGRMKREVW ITAPYAAMIH DFAVTEHFVI FPLMPLTADL ERMKQGGKHF QWQPGLDQLF GILRRDGDGR DVRWFKAPNG FQGHTLNAFD DGGRIFVDMP VTSGNIFYFF PQSDGTVPPP ETLSSQMMRW TFDMRSNGNN IEVKPLTSFA CEFPRSDDRY CGRQYRHGFV IAMDPTKPFD EARIGPRPFQ FFNQLAHLDI ATGKTQLWFA DDQSCFQEPI FVPRRPDAPE GDGYVIGLVN RLAERATDLL VLDAQHLSDG PIATIKLPMR LRMSLHGNWV PGDQLKAV
|
| |