Gene Avi_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3047 
Symbol 
ID7388613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp2536851 
End bp2538404 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content62% 
IMG OID643652018 
Producthypothetical protein 
Protein accessionYP_002550202 
Protein GI222149245 
COG category[S] Function unknown 
COG ID[COG4383] Mu-like prophage protein gp29 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.88762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCAGA TCCTCGACCA ATACGGCAAT CCGATCAGCA GCGGCCAGAT CAGACAAGAG 
CAGGCGGCAC CTACGGTGAC CGGTGTTCGC CGTCCTGTCG GCAATCATCA GGCACCGGGG
CTGACACCGC CGAAGCTGGC GCGGATCCTC AGGGAGTCGA TCGACGGCGA CCCAGAGCGC
TATCTCGAGC TTGCCGAAGA CATGGAGGAA CGCAACGAGC ATTATGCCGG CGTGCTTGGC
GTTCGAAAGC GCCAGGTCGC GGGCCTGGAG ATTACGGTCG AGGCCGCAAG CGATAGCGCC
GATGATGTCG CAGCGGCCGA TCTGGTGCGT GACGTCATCG GCCGCGACGA TCTTGAAGAC
GAGCTGTTCG ATATACTCGA TGCGGTCGGT AAAGGCTTTT CCGCCACCGA AATCATTTGG
GATACATCCG AGGGCCAGTG GACGATCGAG GCACTGAAGT GGCGCGATCC GCGCTGGTTC
GTGTTTGATC GCGACGATGG TGAGACACTT CGGCTGCGTG GTGCCGCAGG CGATGAGGAT
CTCTGGCCGG CGAAGTGGAT CGTGCACAAA GCCAAGATCA AGTCCGGCCT TCCGATCCGA
GGCGGCTTGG CGCGCTCGGC GGCCTGGGCG TATCTGTTCA AGACCTTCAC GGCGACAGAT
TGGGCGATTT TCTGCGAGGC CTACGGGCAG CCGTTGCGCC TCGGCAAATA TGGCTTGAGC
GCTTCCGAAA AGGACAAGGA AGTCCTGCTG CGCGCCGTCA GCAGCATTGC GGCCGACTTT
GCCGCGACCA TCCCGGAAAG CATGGCGGTC GAGTTCGTAC AGGCTCAGCT CTTCGGCAGC
ATCGATCTTT ATGAGCGGCG CGCCGATTGG CTCGACCGTC AGATCTCCAA GCTCGTGCTT
GGTCAGACCG CGACGACGGA TGCGCAGGCT GGCGGTTATG CCGTCGGCAA GGTGCATGAC
GGCGTGCGTG ACGATATCGA GCGGGCCGAT GCCCGGCAAC TGGCCTCGAC ACTCAACCGC
GACCTCGTCA TTCCGCTGGT CGCGCTCAAT TTGGGCTCGC GCAAAAAATA CCCGAAGATC
CGCATCGGCC GGCCGGATGA AACCGACGTC AATGATCTCG TTGCCAACGT CGTCAAGTTG
GTGCCGCTTG GCCTCAAAGT TGGCATGTCA ACGATGCGTG ACAAGCTGGG TCTGCCGGAT
CCGGACGCGG ACGAGGAGCT GCTCGTGCCA AAGGCGGCAA CGCCGGCACC ATCGTCCGAC
CAGGAAGATG CTCCACCGCC AAAGGTCGCT GCGCAAAGTC AGATGGCGGG GGGCGATCGG
GACGCGATCG ATGTCGCAGC TGCCAAGATC GCGGCCGAGG ATTGGAGGGA AATGACCCCG
CCTGTCGTCG ATGGTTTGGC CGATGCCTTG AGCAAGGTGA CGACGCTGGA GGAAACCCAG
GCGCTCCTGG CAGCCCAGGT GAGCGCCATG GGCGTCAACG CTTTTGTCGA GCAGCTCGCG
CGCGCCGCTT TCTCAGCCAG GATTTCGGGC GAGGCAGATG AGGCGCTCTC ATGA
 
Protein sequence
MAQILDQYGN PISSGQIRQE QAAPTVTGVR RPVGNHQAPG LTPPKLARIL RESIDGDPER 
YLELAEDMEE RNEHYAGVLG VRKRQVAGLE ITVEAASDSA DDVAAADLVR DVIGRDDLED
ELFDILDAVG KGFSATEIIW DTSEGQWTIE ALKWRDPRWF VFDRDDGETL RLRGAAGDED
LWPAKWIVHK AKIKSGLPIR GGLARSAAWA YLFKTFTATD WAIFCEAYGQ PLRLGKYGLS
ASEKDKEVLL RAVSSIAADF AATIPESMAV EFVQAQLFGS IDLYERRADW LDRQISKLVL
GQTATTDAQA GGYAVGKVHD GVRDDIERAD ARQLASTLNR DLVIPLVALN LGSRKKYPKI
RIGRPDETDV NDLVANVVKL VPLGLKVGMS TMRDKLGLPD PDADEELLVP KAATPAPSSD
QEDAPPPKVA AQSQMAGGDR DAIDVAAAKI AAEDWREMTP PVVDGLADAL SKVTTLEETQ
ALLAAQVSAM GVNAFVEQLA RAAFSARISG EADEALS