Gene Avi_8020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_8020 
Symbol 
ID7365150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011982 
Strand
Start bp21013 
End bp22674 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content58% 
IMG OID643641702 
Producttrehalose synthase 
Protein accessionYP_002539999 
Protein GI222080136 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCAACG ACCTCTGGTA CAAGAACGCT GTCGTCTACT GCCTGTCCGT CGAGACCTTC 
ATGGATGCCA ACGGCGATGG AGTGGGCGAT TTTCAGGGGC TGCAACGGCG GTTGGATTAT
CTGGCCGGTC TGGGCGTCAA TGCCATCTGG CTGATGCCGT TTCAGGCCTC TCCGGGTCTC
GATGATGGCT ACGATGTCTC CGACTATTAC AACATCGATC CCAGATACGG CACCCTCGGC
GACTTCGTCG AATTCACCCA TAGCGCAAAG CAACGCGGTA TACGGGTGCT GATCGACCTC
GTCGTCAATC ATACGTCCGA TCAGCATCCC TGGTTCAAAC AGGCCTGCGC CGACAAGAAT
TCCCGATACA GGGACTGGTA CGTATGGTCT GAGAAGAAGC CGGAAAATGC CGACGAGGGC
ATGGTCTTTC CTGGTGTACA AAAGACCACA TGGACACGGG ACGAAAAGTC CGGCGAATAC
TATTTTCACC GTTTCTTCAA ATTCCAGCCG GATCTCAACA CCAGCAATCC ACATGTCCAG
GCGGAAATCC TCAAGATCAT GGGTTTCTGG ATACAGCTCG GCGTATCGGG TTTTCGCATG
GACGCCGTTC CCTTCGTGAT CGCCGAAAAG GGTGCAGATG TGAAGGAATC GAAGCCGCAG
TTCGATCTGT TGCGCAGTTT CCGCGAGTTC CTGCAGTGGC GCAAGGGTGA CAGCATCATC
CTTGCGGAGG CCAATGTCGT GCCCAAGGAG AACCTGCAGT ATTTCGGCGA CGACGGCGAT
CGTATGCAGA TGATGTTCAA TTTCCACGTC AACCAGGCTC TGTTCTACGC GCTCGCTAGC
GCCGATACCC GGCCCCTTGA CAAGGCGATG AACGAAACCC GGGAGCGACC GCAAACGGGG
CAATGGGGAA TATTCCTGCG CAACCACGAC GAACTCGATC TTGGCCGGCT GACGGAAAAA
CAGCGGCAAG CCGTGTTTTC CGCCTTCGGT CCAGACAAGG ACATGCAGCT CTATGATCGG
GGGATACGGC GTCGCCTAGC GCCGATGCTG GGCGGCGATA CCAGGCGTCT CAGGCTTGCT
TATAGCCTGA TGTATTCGCT GCCGGGCACC CCGGTCCTGC GATATGGTGA CGAGATCGGC
ATGGGAGACG ACCTAGCATT GGAGGAGCGC AACTGCGCCC GCACGCCTAT GCAATGGTCG
ACCGAGCCGC ATGGCGGCTT CACCAAGGCG GAAAAACCCG TCTTGCCCGT CATCGAAGGA
GGTCCCTATG GGTTCGAACA TGTCAATGTC GCGGCACAGC GGCGGGACGC GGAATCGATG
CTGAACTGGA CCGAGCGGAT GATCCGTATG CGAAAGGAAG CCCCTGAAAT CGGATGGGGA
AGTTTCAGCG TTCTGGACTG CGGCGATACC GGTGTCCTGG CGATGCGCTA CGACTGGCGC
CACAACGCCG TCGTCATCAT CCACAATCTC CACGACAAGC CAGTCGATAT CTCGTTCGAT
CCCGGCGTAG GTGAGAGTGG ACGCGTCCTG ATCGACATCG CCGACGGCAG CGACAGCAGC
GCGGACGAGA AAGGCCGGCA TAACATGGTG ATCGAGCCAT TCGGTTATCG CTGGTACCGC
GCCGGCGGGC TCGATTACCT GCTCAAGAGA AGCGACATCT GA
 
Protein sequence
MINDLWYKNA VVYCLSVETF MDANGDGVGD FQGLQRRLDY LAGLGVNAIW LMPFQASPGL 
DDGYDVSDYY NIDPRYGTLG DFVEFTHSAK QRGIRVLIDL VVNHTSDQHP WFKQACADKN
SRYRDWYVWS EKKPENADEG MVFPGVQKTT WTRDEKSGEY YFHRFFKFQP DLNTSNPHVQ
AEILKIMGFW IQLGVSGFRM DAVPFVIAEK GADVKESKPQ FDLLRSFREF LQWRKGDSII
LAEANVVPKE NLQYFGDDGD RMQMMFNFHV NQALFYALAS ADTRPLDKAM NETRERPQTG
QWGIFLRNHD ELDLGRLTEK QRQAVFSAFG PDKDMQLYDR GIRRRLAPML GGDTRRLRLA
YSLMYSLPGT PVLRYGDEIG MGDDLALEER NCARTPMQWS TEPHGGFTKA EKPVLPVIEG
GPYGFEHVNV AAQRRDAESM LNWTERMIRM RKEAPEIGWG SFSVLDCGDT GVLAMRYDWR
HNAVVIIHNL HDKPVDISFD PGVGESGRVL IDIADGSDSS ADEKGRHNMV IEPFGYRWYR
AGGLDYLLKR SDI