Gene Avi_5950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5950 
Symbol 
ID7381036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp964941 
End bp966407 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content57% 
IMG OID643649464 
Productcrocetin dialdehyde 
Protein accessionYP_002547695 
Protein GI222106904 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.444357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGTCC CATTTCCAAA TAAGCCTGAA TTTACTGGAT CGCTCTATAA GCCGGCCCGT 
TTTGAAGGCC AGGTTTATGA TCTCGAAGTG GAGGGCAAGG TGCCCGAGGA GATCGACGGC
ACGTTCTTTC AGGTTGCTCC CGACCCGCAA TATCCCCCAA TGCTTGGAGA GGACATTTTC
TTCAATGGCG ATGGCGCTGT CAGCGCCTTC CGGTTCAAGA ACGGCCATGT GGATTTTCAG
CGTCGTTACG TGATGACCGA GCGATTGAAG GCTCAGCGCG ACGCCCGCGC CTCCCTGCAT
GGGATCTATC GCAATCCCTT TACCAATGAT CCGAGCGTCA AGGATATTTC CAACTCCACC
GCCAATACCA ATGTCGTTGT TCACAATGGC AAGCTGCTGG CGCTGAAGGA GGATAGCCCT
CCTTATGCCC TCGACCCGAT CACGCTTGAA ACCATCGGTC TTTATGATTT CGATGGTCAG
TTGACCAGCG CGACCTTCAC GGCCCATCCG AAATTTGACC CTGAAACCGG CGATCTGCTG
TGTTTCGGCT ACGAGGCCAA AGGCGAGGCC ACACCCGACA TCGTCTATTA CGAGATCGAC
AAGCATGGCC GGATGAAGCG TGAAGTGTGG ATCACTGCAC CTTATGCCGC AATGATCCAT
GATTTTGCGG TCACGGAGCA TTTCGTGATT TTCCCGCTGA TGCCGTTAAC GGCGGATCTG
GAGCGGATGA AGCAGGGCGG TAAGCACTTC CAGTGGCAGC CGGGTCTGGA TCAGTTGTTC
GGCATTCTGC GCCGGGATGG CGACGGACGT GATGTTCGTT GGTTCAAGGC GCCCAACGGC
TTCCAGGGCC ACACCCTGAA CGCCTTCGAT GACGGCGGCA GGATCTTTGT CGATATGCCC
GTTACATCAG GCAATATCTT CTATTTCTTC CCGCAATCGG ATGGCACGGT GCCGCCGCCG
GAAACCCTGT CGTCGCAGAT GATGCGCTGG ACCTTCGACA TGCGGTCGAA CGGTAACAAT
ATCGAAGTGA AGCCGCTCAC CAGTTTCGCG TGTGAGTTCC CCCGCAGCGA CGACCGCTAT
TGCGGCCGAC AATATCGCCA TGGCTTTGTG ATCGCCATGG ATCCAACCAA GCCCTTTGAC
GAAGCCCGGA TCGGCCCGCG TCCGTTTCAG TTCTTCAACC AGTTGGCGCA TCTGGATATT
GCAACCGGAA AGACCCAGCT CTGGTTTGCC GATGACCAGT CCTGTTTCCA GGAGCCGATC
TTCGTGCCAC GCCGGCCGGA TGCGCCGGAA GGCGATGGCT ATGTGATCGG TCTGGTTAAT
CGGTTGGCGG AGCGCGCCAC GGATCTCCTC GTGCTGGATG CGCAACATCT GTCCGACGGT
CCGATTGCGA CCATCAAGCT GCCGATGCGT CTACGCATGT CGTTGCACGG CAATTGGGTG
CCGGGCGATC AGCTGAAGGC GGTTTGA
 
Protein sequence
MTVPFPNKPE FTGSLYKPAR FEGQVYDLEV EGKVPEEIDG TFFQVAPDPQ YPPMLGEDIF 
FNGDGAVSAF RFKNGHVDFQ RRYVMTERLK AQRDARASLH GIYRNPFTND PSVKDISNST
ANTNVVVHNG KLLALKEDSP PYALDPITLE TIGLYDFDGQ LTSATFTAHP KFDPETGDLL
CFGYEAKGEA TPDIVYYEID KHGRMKREVW ITAPYAAMIH DFAVTEHFVI FPLMPLTADL
ERMKQGGKHF QWQPGLDQLF GILRRDGDGR DVRWFKAPNG FQGHTLNAFD DGGRIFVDMP
VTSGNIFYFF PQSDGTVPPP ETLSSQMMRW TFDMRSNGNN IEVKPLTSFA CEFPRSDDRY
CGRQYRHGFV IAMDPTKPFD EARIGPRPFQ FFNQLAHLDI ATGKTQLWFA DDQSCFQEPI
FVPRRPDAPE GDGYVIGLVN RLAERATDLL VLDAQHLSDG PIATIKLPMR LRMSLHGNWV
PGDQLKAV