Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_2984 |
Symbol | |
ID | 7386150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 2495825 |
End bp | 2499160 |
Gene Length | 3336 bp |
Protein Length | 1111 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643651978 |
Product | tail fiber protein |
Protein accession | YP_002550162 |
Protein GI | 222149205 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATTC CTGTTTTGGC CGTACCGTTT CTCGATCCTT CAGTCGGACG CATTTCCAAA GAACTGCCCG AAGGCCTGAC CATTGCCGAG ATCATCGGCG TGACCATGCC GGGTCTGTCC GATCATGATT TGGCGCTGCT GCGGGTGGTG CTGGTCAGCG ATCGCGGCTC TGCGGTGATC GATCCGCGCT ACTGGCACCG CGTCAGGCCC TATGCCGGTG TGCGTGTCGT CATCCGCCTT GTGCCGGGCA AGAGCGCCCT TCGGTCTGTC CTGACCGTTG TCGTGGCCAT TGCAGCTGCT GCGATCGGCC AGTATTGGGC GACCGCCATG GCGCTTGGCA GTGCAACGGC CACCACGGTC GTTGCGGCCG GCATCACGCT TGGCCTGACC GTGCTTGGCA ATCTGCTGAT CAACGCGCTC ATTCCGCCCG CCAGCAACAA GAAGGACAAG ACCACCTATT ACATCAACGG CTGGCGCAAC AATCTCGATC CGGACGGCGT CATTCCAGAA GTCTACGGCA AGATCCGGTT TGCACCACCG TTCGCCGCCA CCAGCTACAC GGAAATCGTC GGCAATATCC AATATCTGCG CTCGATCTTT CTGGTCGGCT ATGGCGGCGA CTATGGCGTG GCGCTCTCTG AGTTCTGGAT CGGCGACACC AGTATCGACG AATATGATGA GCTGACGATC GAGACCCACG AAGGTCTCGC CAGTGATGGT ACCTTCACGC TCTATTATCG GCAGGTCTAT GAACTGTCCC TCGGCGTCGA GCTGACACGC GAAAGGCCCC GCAACGATCA GGGCAAGGTG ATCAGCGGCG CGGCGAAGGA AGATCCGGTC GTGCGCACCA CGGGTGCCGA TGCCTCGGCA GGTTCGGTGA TCATCGGCTT TCCGGCTGGC CTTGGCCGTG TCGATGACGA AGGCAACAAG AAGAACCTGT CGGTGCAGAT CCGCATCCGG CAGAAGCCCG CCAATGCCGC CGACGATCAA TATGTCGTGG TCACCACGAT GACGATCACC AGCCAGAAGC TCGAAGCCTT CTATCGGCAA TACACATGGT CTTTCGCGAC ACGCGGCCGA TACGACATCG AAGTCACGCG GATGACGGAC GAGCATACCA AGTCGAACTA CCAGAGCCGC ACCACCTGGG TGGCCTTGCA GACGATCCGG CCGGAATATC CGATCGACTT TCCCTATCCG CTGGCGATGA TTTCCATGCG GGTCAAGGCG ACTTATCAGC TCAATGGCCA GCTCGATAAT TTCAACATCA TCGCATCGCG CCGCTGCCTG GATTGGGATG CCGCGACCGG CACATGGATC GGTCGAGAGA CCAACAACCC GGCCTCACTC TATCGCTACT GCCTGCAATC GAAATCCAAT CCCAAACCGG TTGCTGACAG CGAGATCGAT CTCGATGCGC TGGCCGACTG GCACGTCTTC TGCGTATCCA AAGGCCTCGA ATACAATGCG GTCCATGATG ATGACCGCAC GCTGCGGGAG CGCCTGGACG ATATCGCCGG GGCTGGCCGG GCGCGGTCGC GCTATGACGG TGTGCGCTGG AGTGTGATCG TCGATCGGCC GCAGGATCTG GTCATCGACC ATATCAACCC GCGCAACTCC TCGAATTTCA AGGCAAGCCG CACCTACTTC GATCCGCCGC ATGGGTTCCG GGTCAAGTTC TTCGACCAGA CCTATGACTA CAAGCAGAAC GAACGGTTGG TGCCATGGCC GGGCCATTCC GGACCAATCA CCCTGACGGA AGCCCTGGAA CTGCCCGGCA AGACCAATCC GGACGAGATC TGGGTCGAAG CGCGCCGGCG CATGTACGAG GCGCTCTACC GGATCGATAT CTATGAGGCC GTCCAGGACG GGCCGATCAG CGTCGCCACA CGCGGCGATC TGGTCATGGC CTCCTATGAC GTTCTGGAGC GCACACAGGT CGCAGCCCGC GTCCTCGATG TCATTGGCCG CACCATCGAG CTGGACAGCG AAGTCGAGAT GACCTCCGGC CTGACCTATG GTCTGCGGTT TCGGCACTTT GCGGACGAGG ACGACACGAT CGGCGTCAGC GTGTTGGTGA CGTTGCTCAC CGTTGTCGGC ACTGGCAAGA CCGTCGTGAT GGCCGATCAG AACCCCGACA TCGTGCCGGA GACCGGAACG CTGGTGCATG TCGGCTTGCT GACCTCCGAA AGCCTGCCGA TGATCGTCAC GCGGGTCGAG GCCGGGGAAG ACATGTCGTC GCATCTGCGC CTGGTCAACG CAGCAACGAT CATTGACGAG CTGACCGACG AAGAGGTGGC ACCGGCATGG TCCGGCCGCG CGGGGGCGGA TGTCGAAACA TCCAGCAGCG CCCCTCCAAC GCCGACCATC ACGTCGATCG ACACCGGTGT TGTCGGAACG GAGATCTCTG GCGGATTGAG CGTGTCCGCC TCACCGGGTG TTGGGAATGT GGTGACGGTG GCCTATCGGC TCCAGCATCG CAAATCCGGT ACGACCGTAT GGACGCCAAT TGATTTTGGC GTGGGCGATG GTGCGGTGCT GATCACATCC TATGTGACCG GCGATGTTGT CCAGGTGCGG GTGGCGGCAC TTGGCGATAC CGGCTTGATC AGTGCCTTTT CATTGCCCGT CACGGTGACG ATCGGCGCGG ATGATGGTGC CACGCCGGCA CAATTGCCGT CCGGCAATAT CAATGTCGTG GCGATCCTGG GCGGGGCAAC CATCGCCTTC CAGACCACGG ATGATGCGGC AACATCTGCC ATTCAGATCT ATTGCTCGAC CGTCAACGAC CTCGAAACCA CAACGGATGC GATCGGTTCG CCAATCGCAG TCGAGGCATC GCGATCCTAT ACCGTTGCGG TGGGTGACGC GACCCGCTCG AACATGCTGG TGAATGGTGG CTTTGACAGC TCCAGCAATT GGACACTTGG CGACGGCTGG ATGGTTTCAT CCGGCGCGGC GGTTCACAGC TCCGGGACGG CGAGCAATAT CAGCCAGGCC GTGACGTTGA CGGCAGGTGC CACCTATCGC CTCAGCTATG ATTTGACCCG CTCGGCCGGC TCGATCCAGC CAAAGCTGAC GGGCGGAACG ACGGTGTTTG CCGGCAACAG ATCCGCTTCC GCGACGATCC GCGAGACCGT GCAGGCATTG AGCGGCAATA CCGCGCTCGC GCTTGCCGCC ACCGACGCCT TTTCCGGACA GGTCGATAAC CTCGTCCTCT ATCTCGAAAC ATCCACCTGT CTGCCGCAAG GCACCAATTA TCTGTGGCTG GAGCCACGGA ACGCGAACGG CGTGAGTGGA CCGATCACCG GTCCCTTCAC CATCTCGGTT CGATAG
|
Protein sequence | MTIPVLAVPF LDPSVGRISK ELPEGLTIAE IIGVTMPGLS DHDLALLRVV LVSDRGSAVI DPRYWHRVRP YAGVRVVIRL VPGKSALRSV LTVVVAIAAA AIGQYWATAM ALGSATATTV VAAGITLGLT VLGNLLINAL IPPASNKKDK TTYYINGWRN NLDPDGVIPE VYGKIRFAPP FAATSYTEIV GNIQYLRSIF LVGYGGDYGV ALSEFWIGDT SIDEYDELTI ETHEGLASDG TFTLYYRQVY ELSLGVELTR ERPRNDQGKV ISGAAKEDPV VRTTGADASA GSVIIGFPAG LGRVDDEGNK KNLSVQIRIR QKPANAADDQ YVVVTTMTIT SQKLEAFYRQ YTWSFATRGR YDIEVTRMTD EHTKSNYQSR TTWVALQTIR PEYPIDFPYP LAMISMRVKA TYQLNGQLDN FNIIASRRCL DWDAATGTWI GRETNNPASL YRYCLQSKSN PKPVADSEID LDALADWHVF CVSKGLEYNA VHDDDRTLRE RLDDIAGAGR ARSRYDGVRW SVIVDRPQDL VIDHINPRNS SNFKASRTYF DPPHGFRVKF FDQTYDYKQN ERLVPWPGHS GPITLTEALE LPGKTNPDEI WVEARRRMYE ALYRIDIYEA VQDGPISVAT RGDLVMASYD VLERTQVAAR VLDVIGRTIE LDSEVEMTSG LTYGLRFRHF ADEDDTIGVS VLVTLLTVVG TGKTVVMADQ NPDIVPETGT LVHVGLLTSE SLPMIVTRVE AGEDMSSHLR LVNAATIIDE LTDEEVAPAW SGRAGADVET SSSAPPTPTI TSIDTGVVGT EISGGLSVSA SPGVGNVVTV AYRLQHRKSG TTVWTPIDFG VGDGAVLITS YVTGDVVQVR VAALGDTGLI SAFSLPVTVT IGADDGATPA QLPSGNINVV AILGGATIAF QTTDDAATSA IQIYCSTVND LETTTDAIGS PIAVEASRSY TVAVGDATRS NMLVNGGFDS SSNWTLGDGW MVSSGAAVHS SGTASNISQA VTLTAGATYR LSYDLTRSAG SIQPKLTGGT TVFAGNRSAS ATIRETVQAL SGNTALALAA TDAFSGQVDN LVLYLETSTC LPQGTNYLWL EPRNANGVSG PITGPFTISV R
|
| |