Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5331 |
Symbol | |
ID | 7380692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 333017 |
End bp | 334312 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643648954 |
Product | ABC transporter substrate binding protein (sugar) |
Protein accession | YP_002547191 |
Protein GI | 222106400 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.998389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA ACCGACGCAC CGTGGTGAGC GGTCTGGCTT TGGGCCTCGC TGCTGCCGGG CTTTCCACGC CCGTGCTGGC TGCCGACGAA GTCACGCTCA ACGTGCTTTA CAATCTGCCG GGCTTCACGA AATTCCATCA GCCGCTGGCC GATGCGTTCA TGAAGAACAA TCCGAATGTA AAGATCAATT TTCTGGCGCC CGCTCCAGGC TATAACGAGG GTCAGCAGCA GGTCCTGCGC GCTGCCGTGA CCGGCAATCT GCCGGATGTT TATTTCTCAG GCTTCAACCT GACCGCGGAG CTGGTTCACA CACTGGCACC CCGCAACCAG ATCACCGATC TGGCGCCCTT CATCGCGGCG GAAGGCGGCC AGGCCTTCCT CGACAAAAAT TACAACCCGA AAATGGCGGC CCTCGGCCAG ATCGATGGCA AGCAATACGG CCTCCCCGTC AATGCCTCCT CGCCAATCAT CTATATCAAT GCTGATCTGG TAAAGAAGGC TGGCGGCGAT CCGGACAATA TGCCGAAAAC CTTTCCCGGA CTGATCTCGC TGGCCAAGAA TATCCACGCG CTCGATCCGA AAATCTCCGG CATGGGTTAC GACATCAATG GTTGGCCGGA TGACTGGCTT TGGCAGGCAT TGGTTCTCGA GCAGGGCGGC ACATTGGTCA ACGAAAAGAC CAAGACTGTG GCTTTTGACA ACGAGATTGG CCTCAATGCT CTGAAAATGG TTCGCCAGTT CGTGACCGAG GGTGGTCAGA CCCTGCTCGA CTGGGACCAG TCCCGTCAGC AATTTGGTGC TGGTCTCACT GGTTTCATAT TCTCGACACC GGCCCATGTT CAGACGATCG AGGGACTGGT GGGCGACCGT TTCAAGCTGA AGACGGCAAC CTTCCCGCTG GACAACCCGG AAAAGGGTGG CGTACCGACG GGCGGCAACT CAGCCGTGAT CCTGACGCAG GACAAGGCCA AGCAGGACGC CGCCTGGAAA TATCTGAAAT GGATCACCGG GCCTGAGGCG CAGAACACCA TCGTGCGGAT CACCGGCTAT CTGCCGACCA ACAAGCTTGC CACCGGTGCC GACTATCTTG CGCCTTATTA TGCCGAGCAT CCGAATGTAA AGACCGCCTC GCTCCAGGCA GACCGGTCCT TGCCTTGGGC CGGTTACCCA GGCGGCGATT CCGTTCGCGT CTGGCGCACC CAGCGCGACA TTATCGGCAC GGTCATGCGC GGTGAAGTGA CGCCGGAGGT TGGCCTGAAG CAGATGGTCG ACCAGACCAA CGCCTTGTTG AAATAG
|
Protein sequence | MKINRRTVVS GLALGLAAAG LSTPVLAADE VTLNVLYNLP GFTKFHQPLA DAFMKNNPNV KINFLAPAPG YNEGQQQVLR AAVTGNLPDV YFSGFNLTAE LVHTLAPRNQ ITDLAPFIAA EGGQAFLDKN YNPKMAALGQ IDGKQYGLPV NASSPIIYIN ADLVKKAGGD PDNMPKTFPG LISLAKNIHA LDPKISGMGY DINGWPDDWL WQALVLEQGG TLVNEKTKTV AFDNEIGLNA LKMVRQFVTE GGQTLLDWDQ SRQQFGAGLT GFIFSTPAHV QTIEGLVGDR FKLKTATFPL DNPEKGGVPT GGNSAVILTQ DKAKQDAAWK YLKWITGPEA QNTIVRITGY LPTNKLATGA DYLAPYYAEH PNVKTASLQA DRSLPWAGYP GGDSVRVWRT QRDIIGTVMR GEVTPEVGLK QMVDQTNALL K
|
| |