Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_6196 |
Symbol | |
ID | 7381250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | + |
Start bp | 1197356 |
End bp | 1199635 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643649672 |
Product | phage tail protein |
Protein accession | YP_002547896 |
Protein GI | 222107105 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTCGG TCGGCAAGAC CTTGTCTGTC GGGTTGACGG CGCCGATTGC GGCCTTTGGC ACCTTGACTG TTAAGACGGC GGGGGATTTT CAGGCGGCGA TGAACCGCGT TGAGGCTGCG ACCGGTGCGA CGGCAGCAGA GATTGCCGAC ATGCAAAAGA TGGCAATCAA GCTAGGTGCG GACACCACCT TTTCCGCCTC TGAGGCCGCC AATGCTATGG AAATGCTGGC GAAGAATGGC TTGACTGCCA GCCAGATCAT GGGTGGGGCG GTGCAGGCCA GCATGAAGCT GGCGGCGGCA TCCGGCGGGG AGCTGGCGGC ATCTGCTGAC CTTGTTACTG ACGTGATGAT GAATTTCGGG AAAAACGCCA AGGATCTGAA TCCGGTTATC GATGGGATAA CCGGCGTTCT GCTGCAATCG AAATTTGGCT TTGACGACTA TCGTCTTGCA ATAGCGCAGG CTGGCGGCGC GGCTGGCAGC CTTGGTGTCT CTTTTGATGA TTTCAATGCA TCAATTGCGG CCACCAGCTC GGCTTTTTCT AGCGGCTCTG ATGCTGGTAC ATCCTTCAAA ACATTTATCA CCTCGCTGGT ACCAAAGTCC AAAACGGCAA GAGCCACCAT GCGTCAGTTG GGCTTGGAGT TTTTCGAGGC TAACGGCTCG ATGAAAGACA TGTCCGCTAT CGCGGAACAG CTAAAAACCA AGATGTCGGG CCTGTCTGAT GAAGGGCTCA TCGAGGCCAT GAATGATCTT TTTGGCGTTG ATGGTATGCG GACGGCCATC ATGCTGATGA AGACCGGGGG CAAGGGTATT GATGAGCTGA AAGCAAAGAT TGCTAAGGCC TCTGCTGCCG ATCAGGCCGC GGCGCGGTTG AAAGGCTTCA ATGGCGAATT GGAGCAGTTG GGCGGCGCCT TCGAAAGTTT GCAGATTGCC ATTGCCAATA GCGGCTTACT CTCGTTGCTG ACGGAAATGG TAAAGTCGCT TGCGGATTGG GTTGCGAAAC TGGCGGAAAC CAACCCGGAA ATCCTGAAAT GGGGAACGGC GGTCGCTGCA CTCGCTGCGG TTCTCGGTCC TGTTGCGGTT GGTATCGGCG CTGTCGTTGC GGTGATTGCC GCTATTGGCG CACCGATTGC GTTGGCGGTG GCTGGTGCCG CAGCATTGGC GGCGGCGGCT GTGGCGGTTT ACACCAATTG GGATACGATC AAGACGCAAT TTCCGACCAT TGGCGCGATT GTCGAGGGCG CAATTGGTGT GATCAGCGCG ACGTTGACAG CGTTGAACGC CAATGCGCAC TCGATTGTTG CCGGTATTGT CGCGCTGTTT ACCGGGGATT TTGCCGGGGC ATTTACCGCC ATACAGCAGA TTGCCCATAA CTTCGCGGAT CTATGGCTCA ATATTGCCGA GGTAATTTTC CCTGGCGCCA AGGCAGCAAT CATTGCTGGG GTGCAGTCTA TCGGCGCATC CATGGCAACC TTTGGATCGC AGATCCTTTC GACATTCGTC AACCTGGGCG CTGAAATGGT GGTGATTGGC GAGCAGATCA TGGCCGGGCT TTGGCAAGGC ATTCAGAACA AGTGGCAATC GGTCAAGGAA AGCGTGACCA GTATTGCCAG TGGCATCAAG AGCACCTTCA CGGAATTCTT TGACATCAAC TCGCCTTCGC GCGTGATGAC CACACTTGGC GAATATATCA CGCAGGGCCT TGGTGATGGT ATCGCCAACG GCAAAGGGCA AGCCGTATCC TCTGCAACGG ATGTCGCCAA TGGTGTTTCC GGCGCATTGT CGAATATCGA CACAGCAGGA TCTGGCTTGG CGAAGAACAT GGATAATGCT TTTTCGTCCA TCGGTTCCGG GCTGGCGGAT GCGATCAAAG GCACAAAGAG CTGGGGCGAT GTTGCCAAGG GCATCCTTTC ATCGCTGGCG CAATCGCTTA TCGGCACAAT GGGCGGCGGC GGCGGGGTTG GTGGATCGCT GCTCAAGGGC CTGTTTTCCG GCCTGACCGG CTTTGCCAAT GGCGGCACGA TCATGCCGGG GGGCAATGGT GCGGGGATTG ACAGCCAAGT AGTGGCGTTT CGCAAAAGCC CGACTGAGCA GGTCGATATC CATGACCCGC GCAAAAGCAA GAGTAGTGGC GGCGGTGACC GTTACTACTC TATCGATGCG CGGGGCGCTG ATCAGGGTGC TGTTTCCCGG ATCGAGGCTG CCTTGAAAAA GGTGGATGGT TCGATTGAAA AGCGGGCGGT TGCCGCACAA AACTTTAGCA ATAAGCGGAA ATACATCTGA
|
Protein sequence | MESVGKTLSV GLTAPIAAFG TLTVKTAGDF QAAMNRVEAA TGATAAEIAD MQKMAIKLGA DTTFSASEAA NAMEMLAKNG LTASQIMGGA VQASMKLAAA SGGELAASAD LVTDVMMNFG KNAKDLNPVI DGITGVLLQS KFGFDDYRLA IAQAGGAAGS LGVSFDDFNA SIAATSSAFS SGSDAGTSFK TFITSLVPKS KTARATMRQL GLEFFEANGS MKDMSAIAEQ LKTKMSGLSD EGLIEAMNDL FGVDGMRTAI MLMKTGGKGI DELKAKIAKA SAADQAAARL KGFNGELEQL GGAFESLQIA IANSGLLSLL TEMVKSLADW VAKLAETNPE ILKWGTAVAA LAAVLGPVAV GIGAVVAVIA AIGAPIALAV AGAAALAAAA VAVYTNWDTI KTQFPTIGAI VEGAIGVISA TLTALNANAH SIVAGIVALF TGDFAGAFTA IQQIAHNFAD LWLNIAEVIF PGAKAAIIAG VQSIGASMAT FGSQILSTFV NLGAEMVVIG EQIMAGLWQG IQNKWQSVKE SVTSIASGIK STFTEFFDIN SPSRVMTTLG EYITQGLGDG IANGKGQAVS SATDVANGVS GALSNIDTAG SGLAKNMDNA FSSIGSGLAD AIKGTKSWGD VAKGILSSLA QSLIGTMGGG GGVGGSLLKG LFSGLTGFAN GGTIMPGGNG AGIDSQVVAF RKSPTEQVDI HDPRKSKSSG GGDRYYSIDA RGADQGAVSR IEAALKKVDG SIEKRAVAAQ NFSNKRKYI
|
| |