Gene Avi_2984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_2984 
Symbol 
ID7386150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp2495825 
End bp2499160 
Gene Length3336 bp 
Protein Length1111 aa 
Translation table11 
GC content61% 
IMG OID643651978 
Producttail fiber protein 
Protein accessionYP_002550162 
Protein GI222149205 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTC CTGTTTTGGC CGTACCGTTT CTCGATCCTT CAGTCGGACG CATTTCCAAA 
GAACTGCCCG AAGGCCTGAC CATTGCCGAG ATCATCGGCG TGACCATGCC GGGTCTGTCC
GATCATGATT TGGCGCTGCT GCGGGTGGTG CTGGTCAGCG ATCGCGGCTC TGCGGTGATC
GATCCGCGCT ACTGGCACCG CGTCAGGCCC TATGCCGGTG TGCGTGTCGT CATCCGCCTT
GTGCCGGGCA AGAGCGCCCT TCGGTCTGTC CTGACCGTTG TCGTGGCCAT TGCAGCTGCT
GCGATCGGCC AGTATTGGGC GACCGCCATG GCGCTTGGCA GTGCAACGGC CACCACGGTC
GTTGCGGCCG GCATCACGCT TGGCCTGACC GTGCTTGGCA ATCTGCTGAT CAACGCGCTC
ATTCCGCCCG CCAGCAACAA GAAGGACAAG ACCACCTATT ACATCAACGG CTGGCGCAAC
AATCTCGATC CGGACGGCGT CATTCCAGAA GTCTACGGCA AGATCCGGTT TGCACCACCG
TTCGCCGCCA CCAGCTACAC GGAAATCGTC GGCAATATCC AATATCTGCG CTCGATCTTT
CTGGTCGGCT ATGGCGGCGA CTATGGCGTG GCGCTCTCTG AGTTCTGGAT CGGCGACACC
AGTATCGACG AATATGATGA GCTGACGATC GAGACCCACG AAGGTCTCGC CAGTGATGGT
ACCTTCACGC TCTATTATCG GCAGGTCTAT GAACTGTCCC TCGGCGTCGA GCTGACACGC
GAAAGGCCCC GCAACGATCA GGGCAAGGTG ATCAGCGGCG CGGCGAAGGA AGATCCGGTC
GTGCGCACCA CGGGTGCCGA TGCCTCGGCA GGTTCGGTGA TCATCGGCTT TCCGGCTGGC
CTTGGCCGTG TCGATGACGA AGGCAACAAG AAGAACCTGT CGGTGCAGAT CCGCATCCGG
CAGAAGCCCG CCAATGCCGC CGACGATCAA TATGTCGTGG TCACCACGAT GACGATCACC
AGCCAGAAGC TCGAAGCCTT CTATCGGCAA TACACATGGT CTTTCGCGAC ACGCGGCCGA
TACGACATCG AAGTCACGCG GATGACGGAC GAGCATACCA AGTCGAACTA CCAGAGCCGC
ACCACCTGGG TGGCCTTGCA GACGATCCGG CCGGAATATC CGATCGACTT TCCCTATCCG
CTGGCGATGA TTTCCATGCG GGTCAAGGCG ACTTATCAGC TCAATGGCCA GCTCGATAAT
TTCAACATCA TCGCATCGCG CCGCTGCCTG GATTGGGATG CCGCGACCGG CACATGGATC
GGTCGAGAGA CCAACAACCC GGCCTCACTC TATCGCTACT GCCTGCAATC GAAATCCAAT
CCCAAACCGG TTGCTGACAG CGAGATCGAT CTCGATGCGC TGGCCGACTG GCACGTCTTC
TGCGTATCCA AAGGCCTCGA ATACAATGCG GTCCATGATG ATGACCGCAC GCTGCGGGAG
CGCCTGGACG ATATCGCCGG GGCTGGCCGG GCGCGGTCGC GCTATGACGG TGTGCGCTGG
AGTGTGATCG TCGATCGGCC GCAGGATCTG GTCATCGACC ATATCAACCC GCGCAACTCC
TCGAATTTCA AGGCAAGCCG CACCTACTTC GATCCGCCGC ATGGGTTCCG GGTCAAGTTC
TTCGACCAGA CCTATGACTA CAAGCAGAAC GAACGGTTGG TGCCATGGCC GGGCCATTCC
GGACCAATCA CCCTGACGGA AGCCCTGGAA CTGCCCGGCA AGACCAATCC GGACGAGATC
TGGGTCGAAG CGCGCCGGCG CATGTACGAG GCGCTCTACC GGATCGATAT CTATGAGGCC
GTCCAGGACG GGCCGATCAG CGTCGCCACA CGCGGCGATC TGGTCATGGC CTCCTATGAC
GTTCTGGAGC GCACACAGGT CGCAGCCCGC GTCCTCGATG TCATTGGCCG CACCATCGAG
CTGGACAGCG AAGTCGAGAT GACCTCCGGC CTGACCTATG GTCTGCGGTT TCGGCACTTT
GCGGACGAGG ACGACACGAT CGGCGTCAGC GTGTTGGTGA CGTTGCTCAC CGTTGTCGGC
ACTGGCAAGA CCGTCGTGAT GGCCGATCAG AACCCCGACA TCGTGCCGGA GACCGGAACG
CTGGTGCATG TCGGCTTGCT GACCTCCGAA AGCCTGCCGA TGATCGTCAC GCGGGTCGAG
GCCGGGGAAG ACATGTCGTC GCATCTGCGC CTGGTCAACG CAGCAACGAT CATTGACGAG
CTGACCGACG AAGAGGTGGC ACCGGCATGG TCCGGCCGCG CGGGGGCGGA TGTCGAAACA
TCCAGCAGCG CCCCTCCAAC GCCGACCATC ACGTCGATCG ACACCGGTGT TGTCGGAACG
GAGATCTCTG GCGGATTGAG CGTGTCCGCC TCACCGGGTG TTGGGAATGT GGTGACGGTG
GCCTATCGGC TCCAGCATCG CAAATCCGGT ACGACCGTAT GGACGCCAAT TGATTTTGGC
GTGGGCGATG GTGCGGTGCT GATCACATCC TATGTGACCG GCGATGTTGT CCAGGTGCGG
GTGGCGGCAC TTGGCGATAC CGGCTTGATC AGTGCCTTTT CATTGCCCGT CACGGTGACG
ATCGGCGCGG ATGATGGTGC CACGCCGGCA CAATTGCCGT CCGGCAATAT CAATGTCGTG
GCGATCCTGG GCGGGGCAAC CATCGCCTTC CAGACCACGG ATGATGCGGC AACATCTGCC
ATTCAGATCT ATTGCTCGAC CGTCAACGAC CTCGAAACCA CAACGGATGC GATCGGTTCG
CCAATCGCAG TCGAGGCATC GCGATCCTAT ACCGTTGCGG TGGGTGACGC GACCCGCTCG
AACATGCTGG TGAATGGTGG CTTTGACAGC TCCAGCAATT GGACACTTGG CGACGGCTGG
ATGGTTTCAT CCGGCGCGGC GGTTCACAGC TCCGGGACGG CGAGCAATAT CAGCCAGGCC
GTGACGTTGA CGGCAGGTGC CACCTATCGC CTCAGCTATG ATTTGACCCG CTCGGCCGGC
TCGATCCAGC CAAAGCTGAC GGGCGGAACG ACGGTGTTTG CCGGCAACAG ATCCGCTTCC
GCGACGATCC GCGAGACCGT GCAGGCATTG AGCGGCAATA CCGCGCTCGC GCTTGCCGCC
ACCGACGCCT TTTCCGGACA GGTCGATAAC CTCGTCCTCT ATCTCGAAAC ATCCACCTGT
CTGCCGCAAG GCACCAATTA TCTGTGGCTG GAGCCACGGA ACGCGAACGG CGTGAGTGGA
CCGATCACCG GTCCCTTCAC CATCTCGGTT CGATAG
 
Protein sequence
MTIPVLAVPF LDPSVGRISK ELPEGLTIAE IIGVTMPGLS DHDLALLRVV LVSDRGSAVI 
DPRYWHRVRP YAGVRVVIRL VPGKSALRSV LTVVVAIAAA AIGQYWATAM ALGSATATTV
VAAGITLGLT VLGNLLINAL IPPASNKKDK TTYYINGWRN NLDPDGVIPE VYGKIRFAPP
FAATSYTEIV GNIQYLRSIF LVGYGGDYGV ALSEFWIGDT SIDEYDELTI ETHEGLASDG
TFTLYYRQVY ELSLGVELTR ERPRNDQGKV ISGAAKEDPV VRTTGADASA GSVIIGFPAG
LGRVDDEGNK KNLSVQIRIR QKPANAADDQ YVVVTTMTIT SQKLEAFYRQ YTWSFATRGR
YDIEVTRMTD EHTKSNYQSR TTWVALQTIR PEYPIDFPYP LAMISMRVKA TYQLNGQLDN
FNIIASRRCL DWDAATGTWI GRETNNPASL YRYCLQSKSN PKPVADSEID LDALADWHVF
CVSKGLEYNA VHDDDRTLRE RLDDIAGAGR ARSRYDGVRW SVIVDRPQDL VIDHINPRNS
SNFKASRTYF DPPHGFRVKF FDQTYDYKQN ERLVPWPGHS GPITLTEALE LPGKTNPDEI
WVEARRRMYE ALYRIDIYEA VQDGPISVAT RGDLVMASYD VLERTQVAAR VLDVIGRTIE
LDSEVEMTSG LTYGLRFRHF ADEDDTIGVS VLVTLLTVVG TGKTVVMADQ NPDIVPETGT
LVHVGLLTSE SLPMIVTRVE AGEDMSSHLR LVNAATIIDE LTDEEVAPAW SGRAGADVET
SSSAPPTPTI TSIDTGVVGT EISGGLSVSA SPGVGNVVTV AYRLQHRKSG TTVWTPIDFG
VGDGAVLITS YVTGDVVQVR VAALGDTGLI SAFSLPVTVT IGADDGATPA QLPSGNINVV
AILGGATIAF QTTDDAATSA IQIYCSTVND LETTTDAIGS PIAVEASRSY TVAVGDATRS
NMLVNGGFDS SSNWTLGDGW MVSSGAAVHS SGTASNISQA VTLTAGATYR LSYDLTRSAG
SIQPKLTGGT TVFAGNRSAS ATIRETVQAL SGNTALALAA TDAFSGQVDN LVLYLETSTC
LPQGTNYLWL EPRNANGVSG PITGPFTISV R