Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5515 |
Symbol | |
ID | 7381431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 511602 |
End bp | 513503 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643649103 |
Product | ABC transporter substrate binding protein (oligopeptide) |
Protein accession | YP_002547340 |
Protein GI | 222106549 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0171495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAA GCGTTTTGAA ATGGGTTGGC GGTGGGTTTG GCGTCATGCT GTCGCTGAGC AATGCCTATG CGTTCGGCGA AGCGCCGATG CTGAAGAGCC AGGTCGATGC GGGCAAGCTC CCGCCGGTCG AACAGCGCCT TCCCAAGAAG CCTGCCATCA TCCCTGTTTA TAACGAGATC GGCACTTATG GTGGCACGCT CCGGCGTGCC TATAGCGGCA TCGGTGATCG CATGGGGTCC ACCAAGCTGA TCGAAGAGCG GGCCTTGAAG TTGCAGCAAT CGTCCGACGG CAAGGTCGAT CTGGTGCAGC GTTTCGTTGA AAGCTGGTCG GCCAATGCCG ATTCAACCGA ATTTACCTTC ACGCTGCTCG ATGGCATGAA ATGGTCCGAT GGCGTACCGG TCACCACGGA AGATGTGAAA TTCTGGTATG AGGATCTGTT CCTCAATACC GATCTCAATC CCAGCCCTCC CGGCTTCCTG TTTTCCGGCG GCAAGCCGAT GAAACTCGAC ATTGTCGATG CGCATACGTT CAAGGTCAGC TTCGCCCAGC CTTACGCGCT GTTTCCCTAT GTTCTGGCGG TTCAATCGAC GGGATGGCCG GGCCTCGACA AGCCGAGCTT TATCCAGCCA GCCCATTATC TGAAGAAATT CCTGCCGAAA TACAGCACAC CGGCAGAGCT GGATGCCATC GTCAAGGCCA AGGGTGTGCC CAATTGGCGG GCGCTCTGGG ATTTGAAAGG CGTGATCCAG GCCTGGTGGT TGAACCCGGA TCTGCCTGTC GTGACCGCCT GGAAGGTCGT GACGCCGCCA CCGGCCAGCA CCATCGTTTT TGAGCGTAAT CCCTATTATT GGGCAACCGA CAAGGCTGGC AACCAGCTTC CCTATATCGA CCGGATCGAG GCAAAGCTTT TCCAGGACCA TCAGGCGGTC AACCTGATGA TCGTTCAGGG CCAGATCGAC TTCCAGTCGC GGTTTGTTGA AGCGCGGGAT TATCCGCTGC TCAAGGAAAA CGAACAGGCG GGCAATTACA CCGTTCATCC TTGGAAGAGC GGTGAAAACC TGGCGATTAT CCCCAATATC AACGACACCG ACGAGGTGAA ACGCAAGCTG TTTGACGATA TTCGCTTTCG CGAAGCGTTG AGCATCTCGA TTGACCGCGA AGCCATCAAC GAGACAGTGT TTTCCGGTCT GGCCATGCCA AGAGCTGCCG CGCCAGCCAA GGGATCGCCC TATTACGACC CCGAATTCGA GACCAAGTGG ACCGGTCTCG ATATTGATCG CGCCAATACG CTGCTGGACG AAATCGGCCT GAAAAAGGGT AGCGACGGGT TCCGCACCGG GCCGGATGGC AAGCGCCTCA GCCTGGTGAT CGAGACGATT GACGAGAATA TTCCGCCTGA GATGGTTGAG GTTATCCGTC AGGGATGGCA GCAGATCGGC ATTGAAGGGC TGATCCGTTC GGTTGATGAA ACCGCCAGCC TTCAGCATAT CAAATCCGGC AATTTCGATA TCATCACCGC CTATGCTGAC CGGTTGCTGA TGCCGCAGGC CGATCCGACC CTGCTGCTTG GCCGCGAATC CTACGCCAAC GCTTATTTCG AATGGTATAA TTCCGCTGGC AAGAGCGGCA CGGAGCCGCC GAAGGATCAC CCGATCCGCA AGCTGTTTGA TGCCTGGACC GCCGCGTCAA GCTCCAAGAC TGTTGATGAA GCCAACCAGC ATATGAAGGA CATGATCAAA GTCATGAAGG ACAATGTCTG GATGATTGGT CTTGTCGGCG AATCTGTCAC GCCCTTCGTC GTCAACAACA AGATCGGCAA TTTCCCCGAT GTGATGACCA ACGAAGAAGC CCTGCGCAAT GAAGGCAATG CCATTCCCGC GCAGCTGTTC TTCAAGAAGT GA
|
Protein sequence | MQKSVLKWVG GGFGVMLSLS NAYAFGEAPM LKSQVDAGKL PPVEQRLPKK PAIIPVYNEI GTYGGTLRRA YSGIGDRMGS TKLIEERALK LQQSSDGKVD LVQRFVESWS ANADSTEFTF TLLDGMKWSD GVPVTTEDVK FWYEDLFLNT DLNPSPPGFL FSGGKPMKLD IVDAHTFKVS FAQPYALFPY VLAVQSTGWP GLDKPSFIQP AHYLKKFLPK YSTPAELDAI VKAKGVPNWR ALWDLKGVIQ AWWLNPDLPV VTAWKVVTPP PASTIVFERN PYYWATDKAG NQLPYIDRIE AKLFQDHQAV NLMIVQGQID FQSRFVEARD YPLLKENEQA GNYTVHPWKS GENLAIIPNI NDTDEVKRKL FDDIRFREAL SISIDREAIN ETVFSGLAMP RAAAPAKGSP YYDPEFETKW TGLDIDRANT LLDEIGLKKG SDGFRTGPDG KRLSLVIETI DENIPPEMVE VIRQGWQQIG IEGLIRSVDE TASLQHIKSG NFDIITAYAD RLLMPQADPT LLLGRESYAN AYFEWYNSAG KSGTEPPKDH PIRKLFDAWT AASSSKTVDE ANQHMKDMIK VMKDNVWMIG LVGESVTPFV VNNKIGNFPD VMTNEEALRN EGNAIPAQLF FKK
|
| |