Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3047 |
Symbol | |
ID | 7388613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 2536851 |
End bp | 2538404 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643652018 |
Product | hypothetical protein |
Protein accession | YP_002550202 |
Protein GI | 222149245 |
COG category | [S] Function unknown |
COG ID | [COG4383] Mu-like prophage protein gp29 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.88762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCAGA TCCTCGACCA ATACGGCAAT CCGATCAGCA GCGGCCAGAT CAGACAAGAG CAGGCGGCAC CTACGGTGAC CGGTGTTCGC CGTCCTGTCG GCAATCATCA GGCACCGGGG CTGACACCGC CGAAGCTGGC GCGGATCCTC AGGGAGTCGA TCGACGGCGA CCCAGAGCGC TATCTCGAGC TTGCCGAAGA CATGGAGGAA CGCAACGAGC ATTATGCCGG CGTGCTTGGC GTTCGAAAGC GCCAGGTCGC GGGCCTGGAG ATTACGGTCG AGGCCGCAAG CGATAGCGCC GATGATGTCG CAGCGGCCGA TCTGGTGCGT GACGTCATCG GCCGCGACGA TCTTGAAGAC GAGCTGTTCG ATATACTCGA TGCGGTCGGT AAAGGCTTTT CCGCCACCGA AATCATTTGG GATACATCCG AGGGCCAGTG GACGATCGAG GCACTGAAGT GGCGCGATCC GCGCTGGTTC GTGTTTGATC GCGACGATGG TGAGACACTT CGGCTGCGTG GTGCCGCAGG CGATGAGGAT CTCTGGCCGG CGAAGTGGAT CGTGCACAAA GCCAAGATCA AGTCCGGCCT TCCGATCCGA GGCGGCTTGG CGCGCTCGGC GGCCTGGGCG TATCTGTTCA AGACCTTCAC GGCGACAGAT TGGGCGATTT TCTGCGAGGC CTACGGGCAG CCGTTGCGCC TCGGCAAATA TGGCTTGAGC GCTTCCGAAA AGGACAAGGA AGTCCTGCTG CGCGCCGTCA GCAGCATTGC GGCCGACTTT GCCGCGACCA TCCCGGAAAG CATGGCGGTC GAGTTCGTAC AGGCTCAGCT CTTCGGCAGC ATCGATCTTT ATGAGCGGCG CGCCGATTGG CTCGACCGTC AGATCTCCAA GCTCGTGCTT GGTCAGACCG CGACGACGGA TGCGCAGGCT GGCGGTTATG CCGTCGGCAA GGTGCATGAC GGCGTGCGTG ACGATATCGA GCGGGCCGAT GCCCGGCAAC TGGCCTCGAC ACTCAACCGC GACCTCGTCA TTCCGCTGGT CGCGCTCAAT TTGGGCTCGC GCAAAAAATA CCCGAAGATC CGCATCGGCC GGCCGGATGA AACCGACGTC AATGATCTCG TTGCCAACGT CGTCAAGTTG GTGCCGCTTG GCCTCAAAGT TGGCATGTCA ACGATGCGTG ACAAGCTGGG TCTGCCGGAT CCGGACGCGG ACGAGGAGCT GCTCGTGCCA AAGGCGGCAA CGCCGGCACC ATCGTCCGAC CAGGAAGATG CTCCACCGCC AAAGGTCGCT GCGCAAAGTC AGATGGCGGG GGGCGATCGG GACGCGATCG ATGTCGCAGC TGCCAAGATC GCGGCCGAGG ATTGGAGGGA AATGACCCCG CCTGTCGTCG ATGGTTTGGC CGATGCCTTG AGCAAGGTGA CGACGCTGGA GGAAACCCAG GCGCTCCTGG CAGCCCAGGT GAGCGCCATG GGCGTCAACG CTTTTGTCGA GCAGCTCGCG CGCGCCGCTT TCTCAGCCAG GATTTCGGGC GAGGCAGATG AGGCGCTCTC ATGA
|
Protein sequence | MAQILDQYGN PISSGQIRQE QAAPTVTGVR RPVGNHQAPG LTPPKLARIL RESIDGDPER YLELAEDMEE RNEHYAGVLG VRKRQVAGLE ITVEAASDSA DDVAAADLVR DVIGRDDLED ELFDILDAVG KGFSATEIIW DTSEGQWTIE ALKWRDPRWF VFDRDDGETL RLRGAAGDED LWPAKWIVHK AKIKSGLPIR GGLARSAAWA YLFKTFTATD WAIFCEAYGQ PLRLGKYGLS ASEKDKEVLL RAVSSIAADF AATIPESMAV EFVQAQLFGS IDLYERRADW LDRQISKLVL GQTATTDAQA GGYAVGKVHD GVRDDIERAD ARQLASTLNR DLVIPLVALN LGSRKKYPKI RIGRPDETDV NDLVANVVKL VPLGLKVGMS TMRDKLGLPD PDADEELLVP KAATPAPSSD QEDAPPPKVA AQSQMAGGDR DAIDVAAAKI AAEDWREMTP PVVDGLADAL SKVTTLEETQ ALLAAQVSAM GVNAFVEQLA RAAFSARISG EADEALS
|
| |