Gene Avi_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3033 
Symbol 
ID7388602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp2523430 
End bp2524461 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content59% 
IMG OID643652007 
Producthypothetical protein 
Protein accessionYP_002550191 
Protein GI222149234 
COG category[R] General function prediction only 
COG ID[COG2842] Uncharacterized ATPase, putative transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00438308 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGC ACATCAACAC AAGTCAGTTT AACGGCGCTT CCTGGGAGCG CCCCGTCCAG 
GCACCTGAGG TTTCTGCAAA CAAGAGTGAT GCCGATATCG AAAAGTGGTG GGAGCTGATT
GACCGCGTTA TTGCCGTTGC CCGGCAGTTC CGGTGGACGA AGGCGGAAGT TACCCGCCGG
TCGGGCATGA AGGAAGGCAC GTTTAGCCAG TGGTTTTCCG GCCGCTACGA AGGGCGGCTG
GACGGTCACA ACACCATGAT TGAACAATGG CTGGATGCCT TGGAAGCCAG TGCCAGCATT
GCGGCGATGA TTCCGCAATC GCCGCCCTTC ATGAAGCTTC GCGGTTCGGC GGAGGTGCTG
GAAACGCTGA CGTGGGCGCA GATTTGCCCC GATCTGGTGA TGATCACGCT GGGCGCTGGC
ATGGGCAAGA CCGCGACATG TGAGTATTTC ACCAACACGC GCCCGCATGT CTATCACGCC
ACCGTTTCTG AGAGCACCAA GACGGTTCAC GGCATGTTGA CGGAGCTGGC CGAGCAGCTC
GCGGTTCAGG AGAACAACCC GGCGCGTCTG GCGCGGGCGA TCGGGACCAA GTTGAAGCGG
ACCGGTGACG GGACGTTGCT GATCGTTGAC GAGGGCCAGC ACCTTAACGA CGAGGCGCTC
AACCAGCTTC GCCATTTCGT GGATGTGTAC AAATGCGGTG TGGCCGTCGT TGGCAACTCG
GAGGTCTATA GCCGGTTTGC CAGCAACAAA AAGGGGCCGA GTTATGCCCA GCTGAAAAGC
CGCATCGGTA AGCGCCTGCA ACGGGTGCAG CCCTATCCGG ATGACTTGCA AACCTACATT
GCCGCCTGGA ATGTAACCGA TCCGGCCTGC ATCAAGTTTC TGATGGGCAT CGGCTTGAAG
GGCGGTGCCT TCCGGCAGAT CGAAAAGACA ATGCGCATGG CCTTGATGGT GGCGCTTGGG
GCAGGGACCG AGGTTGGCTT AAAGGACATT CAGGCCGCCT GGAAGAACCG CGACGTGGAG
GACATGGCAT GA
 
Protein sequence
MNKHINTSQF NGASWERPVQ APEVSANKSD ADIEKWWELI DRVIAVARQF RWTKAEVTRR 
SGMKEGTFSQ WFSGRYEGRL DGHNTMIEQW LDALEASASI AAMIPQSPPF MKLRGSAEVL
ETLTWAQICP DLVMITLGAG MGKTATCEYF TNTRPHVYHA TVSESTKTVH GMLTELAEQL
AVQENNPARL ARAIGTKLKR TGDGTLLIVD EGQHLNDEAL NQLRHFVDVY KCGVAVVGNS
EVYSRFASNK KGPSYAQLKS RIGKRLQRVQ PYPDDLQTYI AAWNVTDPAC IKFLMGIGLK
GGAFRQIEKT MRMALMVALG AGTEVGLKDI QAAWKNRDVE DMA