Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_7212 |
Symbol | |
ID | 7380370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011981 |
Strand | - |
Start bp | 181186 |
End bp | 182781 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643641317 |
Product | ABC transporter substrate binding protein (oligopeptide) |
Protein accession | YP_002539614 |
Protein GI | 222102575 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTCA CCAGACGCGA CTTTCACCGC ATTGCCCTTG CCGCAGGGGC CTTCACGGCG CTGCCCGGCG GGGCCTTTGC CGCCGCAGAA GAGCCAATTT CCGGCGGAAC GCTGAAGATT GTTTATTTTC CCGAGCCAAC CCAGTTGGTG GCCATCAACA CCAGCTCGGG CGGTGCGCAA TTCATCGGAG CCAAAATTTT CGATGGTTTG CTGACCTATG ATTACGACCT GACGCCCAAA CCAGCGCTGG CCAAGGAATG GAGCATTTCA CCTGACGGTA AGACCTATAT CTTCCATCTG CGCCCCAATG TAACCTTCCA TGATGGCAAG CCGTTGACAT CGGCGGATGT CGCCTTTTCG ATTTTCCGCC TCAAGGAAGC CCATCCGCGG GGACGGGCTA TTTTTGCCAC CGTGGAAAGC ATTGATACCA GCGATCCGCT GGTGGCAAAG CTCATCCTTT CCAAGCCGAC GCCAGCGCTG ATCACCGCCC TGTCTGGCTC TGAATCACCC ATTGTGCCAA AGCACATTTT CGAGACCTTC AAGCCCACCG AAAATCCAAA GCCGCAGCAG ATTATTGGCA GCGGCCCCTT TGCGCTGAAG GAATGGGTGC CGGGCAGCCA TATTCTGCTG GAACGCAATG CTGCCCATTG GGATGCGCCA AGGCCCTATC TCGACCGCGT TATTCTGAGG CTGATCAATG ATGCCAGCGC TCGCTCGGCA GCCCTTGAAA CCGGGGAGGC CGATATTGGT CCCAATCCGG TTCCGCTTTC TGATCTGGAT CGGTTGAAAA GCGTTGCGAC ACTCAAGGTC GATGATCAGA TCTATGCCTA TGCGGGCCAG CAGAACCAGT TGGTGATCAA TCTGGAAAAC AGCTACCTGA AGGAGCAGAA AGTCCGCCAA GCCATTGCCC ATGCCATCGA TGTGAAGGCG CTGATCAACA TCGTGCTCTA TGGCTATGGC ATCCCCTCAC CCACGCCGAT CAGCCCCGGT CTGGCGAAAT TTCACAATCC CGATATTGGC TTTGCCAAAT ATGATGTGGC CTTAGCCGAA AAACTGCTGG ATGAAGCGGG CTTTCGCCGT CAAGCAAACG GCAAGCGTTT CAAGCTAAGG GTCACGACCA ACCCGTTTAA TCCGGCGAGC TATTCAGACT TTATCGCACA AGCGCTGATC AAGATCGGCA TTGAGGCGGA TATTCAAAAA TTCGATTTCG GCACCTATGT CAAAGTGGTC TATACGGACC GCGCCTGGGA TCTTTCGGTG GAATCGCTCT CCAACACATT TGATCCCACC GCTGGCGTGC AGCGGGTCTA TTGGAGCAAG AATTTCAAAA TCGGCCTGCC CTTTTCCAAC GCCAGCCACT ACGAAAACCC GGAGGTGGAC CGCCTGCTGG AAGCGGCTGC GGTGGAGCCG GACATTGAAA AACGCGCCCA ATTGTTTAGG GATTTTCAGG TGATCATCGC CCGCGATCTT CCCGTGATCA ATCTCGTTTC CCCCATCCAA CCCGTGGTGG GCAACAAGCG CGTGCGCAAT TACGCTTATG GCGCGGAAGG TCTGATTGGC AACCTTGCCT ATGCCAGCCT TGCAAAAAAC AGCTAA
|
Protein sequence | MTLTRRDFHR IALAAGAFTA LPGGAFAAAE EPISGGTLKI VYFPEPTQLV AINTSSGGAQ FIGAKIFDGL LTYDYDLTPK PALAKEWSIS PDGKTYIFHL RPNVTFHDGK PLTSADVAFS IFRLKEAHPR GRAIFATVES IDTSDPLVAK LILSKPTPAL ITALSGSESP IVPKHIFETF KPTENPKPQQ IIGSGPFALK EWVPGSHILL ERNAAHWDAP RPYLDRVILR LINDASARSA ALETGEADIG PNPVPLSDLD RLKSVATLKV DDQIYAYAGQ QNQLVINLEN SYLKEQKVRQ AIAHAIDVKA LINIVLYGYG IPSPTPISPG LAKFHNPDIG FAKYDVALAE KLLDEAGFRR QANGKRFKLR VTTNPFNPAS YSDFIAQALI KIGIEADIQK FDFGTYVKVV YTDRAWDLSV ESLSNTFDPT AGVQRVYWSK NFKIGLPFSN ASHYENPEVD RLLEAAAVEP DIEKRAQLFR DFQVIIARDL PVINLVSPIQ PVVGNKRVRN YAYGAEGLIG NLAYASLAKN S
|
| |