Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5570 |
Symbol | thuB |
ID | 7381478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 582202 |
End bp | 583302 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643649150 |
Product | trehalose utilization-related protein |
Protein accession | YP_002547387 |
Protein GI | 222106596 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.541955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACAC CCGTCCGTAT TCTCATTCTC GGTACCGGTG GCATGGCGAA CCGCCATGCC ATCGAGTTTG CCACCAATAG CGACGCCCAA CTGGTCGGCG CCGTCGATGT CGATATCGCC AAGGTACGTG GTTTTGCGCT GCGCCATGAT ATCGCCAATA CATTCACCTC GCTGGAGGAC GCGCTTGCCT GGGGTGAGTT CGATGCCGTC GCCAATGTGA CGCCCGACCG GGTGCATTAT CCGACGACAC TGCAATTGAT CGCGGCGGGC AAGCATGTGT TTTGCGAAAA GCCGCTGGCT GAAAGCTTTG AAAAAGCTGA TGAGATGGCC CGCAAAGCCA ATGCCGCTGG ACTGGTGAAC ATGGTCAACC TGACCTATCG CAATGTCGCG CCGGTACAGA AAGCACGACA GCTGGTTCAG GCTGGTGAGA TCGGCAAGGT CCGTCATATC GAGGCCTCCT ATCTGCAAAG CTGGCTGGTG TCCAAGGCCT GGGGCGACTG GGCAACCGAG GACCAATGGC TATGGCGGCT ATCCACCAAG CACGGCTCAA ACGGGGTGCT CGGCGATGTC GGCATCCATA TTCTCGATTT CGCCAGCTAT GGCGCGGCAA CCGATGTCGA GCATATCTTT GCGCGGTTGA AGACATTTGA GAAAGCGCCG GGCAACCGGA TTGGCGACTA TGATCTTGAC GCCAATGACA GTTTCACCAT GACGGCGGAA TTCAGCAATG GCGCCATCGG CGTCGTGCAC GCCTCGCGCT GGGCAACCGG CCACCTTAAC GAATTGCGCC TGCGCATGCA TGGCGACAAG GGCGCATTGG AGGTGATCCA CACGCCAACG GGCAATACAT TGCGCGCCTG CATGGGCGAC GATGTGGAAA AAGGCATTTG GACGGAAGTG GATGCCGGCA CCGTTGCCAC CAATTACCAG CGCTTCGTCG AGGCGGTGCT GGAGGGCAAA ACCCGCGAAC CTGACTTCCG CCATGCGGCT GACCTGCAAA AAGTGCTGGA TCTGGCGATG GTGACAGAGT TGGAGCGGCG CGAACATTCC GTTCGCCGTG TTGATGCGGC ACCTTCGCTG GCTGCGGTAT CATCAAGGTG A
|
Protein sequence | MNTPVRILIL GTGGMANRHA IEFATNSDAQ LVGAVDVDIA KVRGFALRHD IANTFTSLED ALAWGEFDAV ANVTPDRVHY PTTLQLIAAG KHVFCEKPLA ESFEKADEMA RKANAAGLVN MVNLTYRNVA PVQKARQLVQ AGEIGKVRHI EASYLQSWLV SKAWGDWATE DQWLWRLSTK HGSNGVLGDV GIHILDFASY GAATDVEHIF ARLKTFEKAP GNRIGDYDLD ANDSFTMTAE FSNGAIGVVH ASRWATGHLN ELRLRMHGDK GALEVIHTPT GNTLRACMGD DVEKGIWTEV DAGTVATNYQ RFVEAVLEGK TREPDFRHAA DLQKVLDLAM VTELERREHS VRRVDAAPSL AAVSSR
|
| |