Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3006 |
Symbol | |
ID | 7388588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 2512360 |
End bp | 2513913 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643651993 |
Product | hypothetical protein |
Protein accession | YP_002550177 |
Protein GI | 222149220 |
COG category | [S] Function unknown |
COG ID | [COG4383] Mu-like prophage protein gp29 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.357056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCAGA TCCTCGACCA GTATGGCAAC CCGATCGTCA AGGCCGCGAT GAAGCAGGAG CAAGGCGGGC CGACGACGAC TGGCGTTCGC CAAGCGGCTG GCAATCATCA GGCACCCGGT CTGACACCCC AGAAACTTGC CCGCATTCTG CGCGAGGCCA TTGATGGCGA TCCGGAGCGC TACCTTGAGC TTGCGGAGGA CATGGAGGAA CGCAACGAGC ATTATGCTGG TGTTCTGGGT GTGCGCAAAC GACAGGTCGC CGGACTGGAC ATCACGGTTG AGGCAGCCAG CGATAGTGCA GATGATGTCC GCGATGCCGA CCTCGTGCGC GACTTTATAT CCCGCGATAC CTTTGAGGAC GAGCTGTTCG ATATCCTCGA TGCCGTCGGA AAGAGCTTCA GCGCCACCGA AATCATCTGG GACACATCCG AAGGCCAGTG GTCAATCAGG GCGCTCAAAT GGAATGATCC ACGATGGTTC CGATTCGATC GCAATGATGG AGAGACCTTG AGACTGCGTG GTCCGGCCGG TGACGAAGAT CTCTGGCCAG CCAAATGGAT TGTGCATCGC GCCAAGGTCA AATCGGGCCT GACCATTCGT GGCGGCTTGG CCCGATCAGC GGCTTGGACC TATCTATTCA AGACCTTCAC GACTTCGGAT TGGGCCATCT TCTGCGAGGC TTATGGCCAG CCGTTGCGGC TGGGCAAATA TGGCGCTGGC GCAAGCGAGC CCGACAAAGA AAAGCTGCTG CGGGCCGTCT CCAGCATCGC GGCCGATTAT GCCGCCATCG TGCCAGAAAG CATGGCGATT GAGTTTGTCC AGGCGCAATT GTCCGGTAGC CTCGATCTCT ATGAACGGCG GTCCGACTGG CTGGACCGGC AAATATCCAA GCTTGTCCTG GGACAGACGG CTACCACCGA TGCACAGGCT GGCGGCTATG CGGTCGGCAA GGTGCATGAC GGTGTGCGTG AAGATATCGA GCGGGCTGAT GCCCGCCAGT TGGCCGCGAC CCTAAATCGG GATGTTGTCG TGCCACTGGT ATCGTTCAAT CGTGGCCCCC GAAAAAATTA TCCGAAGATC TGCATCGGAC GTCCCGACGA TATCGATGTC AACAATCTGG TTGCTAATGT CGTCAAGCTC GTCCCGCTCG GCTTGAAAGT CGGCATGTCC ACCATGCGCG ACAAGATTGG CCTGCCTGAC CCAGGCAAGG ACGAGGAAAT CCTGAAGCCG GCCGCAGCGG CAGCACAGCC GCCTGCCGAT CAGGCGGACA GCACACCGCC TGCGATTGCC ACCCAAAGCC AGATGGCCCG CGCCGATCGC GATGCTATCG ATGCGGCAGC CGGCCAGATC TCGGCTGATG ATTGGCAGGA GATGACACCG CCCGTGGTCG ATGGTTTGGC CGAGGCATTG AGCAAAGCGA CGTCGATCGC GGAGGCACAG GCGATCCTTG CGGCTCAAGT CAGCGCCATG GGCGTCAATG CCTTTGTCGA GCAACTCGCT CGTGCTGCGT TCTCGGCCCG GATCTCCGGT GAAGCCGATG AGCCGCTCTC ATGA
|
Protein sequence | MAQILDQYGN PIVKAAMKQE QGGPTTTGVR QAAGNHQAPG LTPQKLARIL REAIDGDPER YLELAEDMEE RNEHYAGVLG VRKRQVAGLD ITVEAASDSA DDVRDADLVR DFISRDTFED ELFDILDAVG KSFSATEIIW DTSEGQWSIR ALKWNDPRWF RFDRNDGETL RLRGPAGDED LWPAKWIVHR AKVKSGLTIR GGLARSAAWT YLFKTFTTSD WAIFCEAYGQ PLRLGKYGAG ASEPDKEKLL RAVSSIAADY AAIVPESMAI EFVQAQLSGS LDLYERRSDW LDRQISKLVL GQTATTDAQA GGYAVGKVHD GVREDIERAD ARQLAATLNR DVVVPLVSFN RGPRKNYPKI CIGRPDDIDV NNLVANVVKL VPLGLKVGMS TMRDKIGLPD PGKDEEILKP AAAAAQPPAD QADSTPPAIA TQSQMARADR DAIDAAAGQI SADDWQEMTP PVVDGLAEAL SKATSIAEAQ AILAAQVSAM GVNAFVEQLA RAAFSARISG EADEPLS
|
| |