Gene Avi_5570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5570 
SymbolthuB 
ID7381478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp582202 
End bp583302 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content59% 
IMG OID643649150 
Producttrehalose utilization-related protein 
Protein accessionYP_002547387 
Protein GI222106596 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.541955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAC CCGTCCGTAT TCTCATTCTC GGTACCGGTG GCATGGCGAA CCGCCATGCC 
ATCGAGTTTG CCACCAATAG CGACGCCCAA CTGGTCGGCG CCGTCGATGT CGATATCGCC
AAGGTACGTG GTTTTGCGCT GCGCCATGAT ATCGCCAATA CATTCACCTC GCTGGAGGAC
GCGCTTGCCT GGGGTGAGTT CGATGCCGTC GCCAATGTGA CGCCCGACCG GGTGCATTAT
CCGACGACAC TGCAATTGAT CGCGGCGGGC AAGCATGTGT TTTGCGAAAA GCCGCTGGCT
GAAAGCTTTG AAAAAGCTGA TGAGATGGCC CGCAAAGCCA ATGCCGCTGG ACTGGTGAAC
ATGGTCAACC TGACCTATCG CAATGTCGCG CCGGTACAGA AAGCACGACA GCTGGTTCAG
GCTGGTGAGA TCGGCAAGGT CCGTCATATC GAGGCCTCCT ATCTGCAAAG CTGGCTGGTG
TCCAAGGCCT GGGGCGACTG GGCAACCGAG GACCAATGGC TATGGCGGCT ATCCACCAAG
CACGGCTCAA ACGGGGTGCT CGGCGATGTC GGCATCCATA TTCTCGATTT CGCCAGCTAT
GGCGCGGCAA CCGATGTCGA GCATATCTTT GCGCGGTTGA AGACATTTGA GAAAGCGCCG
GGCAACCGGA TTGGCGACTA TGATCTTGAC GCCAATGACA GTTTCACCAT GACGGCGGAA
TTCAGCAATG GCGCCATCGG CGTCGTGCAC GCCTCGCGCT GGGCAACCGG CCACCTTAAC
GAATTGCGCC TGCGCATGCA TGGCGACAAG GGCGCATTGG AGGTGATCCA CACGCCAACG
GGCAATACAT TGCGCGCCTG CATGGGCGAC GATGTGGAAA AAGGCATTTG GACGGAAGTG
GATGCCGGCA CCGTTGCCAC CAATTACCAG CGCTTCGTCG AGGCGGTGCT GGAGGGCAAA
ACCCGCGAAC CTGACTTCCG CCATGCGGCT GACCTGCAAA AAGTGCTGGA TCTGGCGATG
GTGACAGAGT TGGAGCGGCG CGAACATTCC GTTCGCCGTG TTGATGCGGC ACCTTCGCTG
GCTGCGGTAT CATCAAGGTG A
 
Protein sequence
MNTPVRILIL GTGGMANRHA IEFATNSDAQ LVGAVDVDIA KVRGFALRHD IANTFTSLED 
ALAWGEFDAV ANVTPDRVHY PTTLQLIAAG KHVFCEKPLA ESFEKADEMA RKANAAGLVN
MVNLTYRNVA PVQKARQLVQ AGEIGKVRHI EASYLQSWLV SKAWGDWATE DQWLWRLSTK
HGSNGVLGDV GIHILDFASY GAATDVEHIF ARLKTFEKAP GNRIGDYDLD ANDSFTMTAE
FSNGAIGVVH ASRWATGHLN ELRLRMHGDK GALEVIHTPT GNTLRACMGD DVEKGIWTEV
DAGTVATNYQ RFVEAVLEGK TREPDFRHAA DLQKVLDLAM VTELERREHS VRRVDAAPSL
AAVSSR