Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5299 |
Symbol | |
ID | 7380665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 301451 |
End bp | 303121 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643648927 |
Product | ABC transporter substrate binding protein (oligopeptide) |
Protein accession | YP_002547164 |
Protein GI | 222106373 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGATA AGGAACGCAA TTCGATACCG CATCCCACGC GTCGCCAGAC GCTTGGCCTC ATGGCGCTTG GAGCGTCCGG CGTGCTTCTC TCGGGGCTGC CAATCTCGGA GATTCGCGCG GCCAGTGCCG CGACCAAATC GATCAAGGGA CAGCTGACTG TCGGCTTTTC CCAGGAGCCG ACGGTTTTCA ATCCGCATAT GCCTCATATC GAAGTGGATG AAGGCATTCA TTTCAGCATA TTCGATACGC TGTTCAGCGT TGATGCCGCC GGCAAATTCG TTCCGGGGCT GGCCGTGGAC GTCCCGAGCG TCGAAAACGG CGGCATCTCA GCCGATGGCC TGAAGTGGAA GATCAAGCTG CGCGATGGCG TGACATGGCA TGACGGCAAG CCGTTCACCG CTGAAGACGT CAAGGCAACG CTCGAACTGC TGGTCGATGC CAATTTCCGC AGCTGGCGTA AAACCGGCCA TGAATTCGTC CGTGACCTGA CCGTTGTTTC TCCCACGGAA ATCACCTGGC GGATGGACAA GCCCTTCGCG CCCTATCCTT CCATTCTGGC CTCGACCTTC ATCACGCCGA AACATCTTCT GTCGGCCAGC GCCGATCCGA ACAATGCGCC GTTCAACACC GCGCCGGTTG GCACCGGTCC GTTCAAATGG GCTGAGCGGG TCGCCGGAGA CCATATCCTG CTGGCTGCCA ACCCGGATTA TTTTGGTGAC GGCCCCTATG TCGAAAAGCT GATTTACAAA TACGTTCCCG ATCTGAATGT CATGTACACC CAGTTCAAGA CGGGCGATAT TGATGTGGTC GGGTTGCAGT GGATCACGCC CGATCACTAC GAGGAAGCCA AGGGGCTGGA CGGCAAGGTG GTCAATGTGG TGCCGGGTTC GACGGTTGAA TCCTTCACCT TCAACATGGA GCGCCCACAG TTCAAGGAAG CGGCGGTGCG CGAGGCTCTG TATGCCGCCA TCGACAAGCA GTCGATTATC GAAGCGCTCT ATTACGGCCT GCCGACCCCC ACCGAAAGCT ATGTGCCGCA GCAATCCTTC TACTTCAATC CTGACCTGCC GAAGCATGAA TATGACATCG CCAAGGCCAA GAAGCTGCTG GATGATGCCG GCTGGGTGGC TGGTAGCGAC GGCATTCGCG CCAAGGGTGG GGTTAAACTG TCCTTCACCT GCTCAACGAC GGCGGGCAAC CATATCCGCG AACAGGTCCA GCAATTCCTG CAACAGTCCT TCAAGGATAT CGGGGTGGAA ATGACCATTT CCAACCTGCC GCCAGCGGTG ATGTGGGGCG ATTATTGGAC GCTGTCGAAG TTCGACGCCG TCATCGTTGG CCTGGATTTC CTGACGGGGT CCGATCCCGA CACTTCCAAT TTCTTCCGTT CTACGGCGAT CCCGGCCAAG GGCGGTTCCG GCCAGAACAC CTGGCAGTTT TCCAACCAGC AAGTAGACGA GCTGCTCACC AAGGGCGGCG AGCTGTTCGT ACCGGAGGAG CGCAAGGCCG TCTACCTGAA GATCCAGGAG ATCATGCGCA AGGAACTGCC GCTCCTGCCA ATGTTCCAGT ATGCGACCGT GCGCGGCCAC AAGCAGGGCG TCGAGAACGT GACGCCCAAC GTGAATGTGC GTATCGACAC CTGGAACGTC GCCACCTGGC GCTGGGCCTG A
|
Protein sequence | MSDKERNSIP HPTRRQTLGL MALGASGVLL SGLPISEIRA ASAATKSIKG QLTVGFSQEP TVFNPHMPHI EVDEGIHFSI FDTLFSVDAA GKFVPGLAVD VPSVENGGIS ADGLKWKIKL RDGVTWHDGK PFTAEDVKAT LELLVDANFR SWRKTGHEFV RDLTVVSPTE ITWRMDKPFA PYPSILASTF ITPKHLLSAS ADPNNAPFNT APVGTGPFKW AERVAGDHIL LAANPDYFGD GPYVEKLIYK YVPDLNVMYT QFKTGDIDVV GLQWITPDHY EEAKGLDGKV VNVVPGSTVE SFTFNMERPQ FKEAAVREAL YAAIDKQSII EALYYGLPTP TESYVPQQSF YFNPDLPKHE YDIAKAKKLL DDAGWVAGSD GIRAKGGVKL SFTCSTTAGN HIREQVQQFL QQSFKDIGVE MTISNLPPAV MWGDYWTLSK FDAVIVGLDF LTGSDPDTSN FFRSTAIPAK GGSGQNTWQF SNQQVDELLT KGGELFVPEE RKAVYLKIQE IMRKELPLLP MFQYATVRGH KQGVENVTPN VNVRIDTWNV ATWRWA
|
| |