Gene Avi_0703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_0703 
Symbol 
ID7388732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp604211 
End bp607426 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content51% 
IMG OID643650296 
Productphosphohydrolase protein 
Protein accessionYP_002548506 
Protein GI222147549 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.159768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGC TCGCGCATAT ATCGCAACTC GAATTATTCG ACGGACAAAT CGTGACCGCT 
CCAGACAGAA TTACCATTCT CCATGTCAGT GATTTCCATT TCAACGTGCG CAAAAAGCGT
GAACAGGAAA TCGTAGTTAA GGCGTTGCTT GACGACATTT CCAAACTCTG CATCGGCCAT
AGAAAACCCG ACGTCATTAT TTTCAGCGGC GATCTTACAC AGGGGCCGCC TGGCGACACC
CATGCAGAAG CATACGATTT TCTTCTGGAA CCACTACAGA AGGCGACAAA CGTCTCGAGC
GAGAGGATGT TCATCGTTGC GGGCAATCAC GATGTCGAGC GAGACGCCGT CGCTAAGTTC
CTACCTGAAC ATATCAAATG GCGATCGACT TCGAACGACA TGGCCGCCAT GAATACCGCG
TTCGAGAGTG GTGAGTTCGA ACCGGCATAT CGCGCAAAGT TCGCCAACTA CCTTGATCTC
GAAGACTACA TGGCGAAGAG CAGCACCGTG TTCGCGAACA TTTTTTGCTC GGTCTACCAT
ATTGACGCGC TGAACACTGA CATCATTGTT TTCAACTCGT CAGTTCTCTC GACTGGCGGT
CTCGATAAGG ATGATCGTGA CGAAGGGCTG CTGACGATAC CGGAATACGC AATTCGCGAC
GCGCTATCTT ATTTGAAGCC TGGAACGTTT CGGATCTTCA CCACCCATCA CCCGCTGAAC
GCTTTCACGG AGACTGGTGC CAGCTACCTC TCGAAGGAGA TACAACGAAA TGCCAATATT
CACCTGTTTG GCCATATGCA TGATCCGTCG GGAGCACAGG TCGTAGGCTT CGACGGGACC
GTAATAACGA ACCAAGCAGG CGCGGTCTTC ACGCACCGCA CTGACTGGTA TATCGGGTAT
GCATTGCTTT GCGTGGATCG AGCCAAAGGG TACCACGAGA CAATACTGAA GAGCTATTTC
CCCGAACGCT CCAAGTTCGA TGATGCAATC GATCGTGTGG CAGACGGTCG CTTCTATAAC
TCGCAGGAGG CGAGACAGTA TTGGCGAGCG CTGTCGAACC CCGTTGATGA CAACGCCTTC
CGCAAGCAAT TGTCCGGTCA GGTCTTTGAA GACCTGATCA AGGAGTGGTC GGAGTTGTCG
CTAATCAATC AAGCTGGACG CCAGCACTTC GTAGCGCCGC GCCTCCACAA GATCGACACG
ACGGCACCCA AGGAAACGCG CACCCGCCTA GACACACTGC TTCCATTCAA ATCACTCACC
GCCGACGCGG GTAACATTAT CCTGTATAGT CCTCAGGAAT ATGGCCGCAC AACGATTCTT
CGCGAAATCC AATACGAGTT GCTCGCCACT TCGCATACAA TCGAGCAGCC ACGTCTTCCA
ATCATGATCG GCTTCGATCA GATCAGCCAA AATCCAGCCA ATCTAGAAAG GCTTGTACGA
AGTCGGTCTC TTATGGACCC ACAGGCATTC GGCACCGAGA GCCTGTTGAA GTTGGGTCTG
GTCTGTTTGC TTATTGATGA CGTCGTCTTT GACGACCACA AACGCATGGG AATTTTACGG
GCATTTATCT CAGCCTATCC CCAATGCCGA TACGTTTTCA CCAGTCTGAA GACATCGGCA
GCCCCCTACG GCGCTCATGT CGTTCCGGAA ACGCCGATCC GTTTCGAATT TGTGGAACTC
TGCGAACTCA AACGGAGGGA GATGCGAGAA CTCATCCGCC TCAAGGTACA AGATGCCGCT
GCGGTGGAAG TCATCCTCGA CCGCCTTCAC TCGGAGATAT CAGAGATCAA TCTGCCCTTC
ACCGCTGCCA ATGGCACGAT CTTAATGACG ATCTACGAGC AAAACAGCAA TTTTCAGCCA
ATTAATCGAT CCGTAATGAT TGAACAATTC GTAGATGCAA CTTTGCGCAA GAGTGCTGCG
GAACAAACGC GACGCGAAAC GTTTGACTAC TCCAACAAGA CATCGCTGCT CGCGCACATA
GCGGGCTGGA TGGCAAAGCA AAATTGCTAC GAACCCGCCC AGGAAGCGCT GCGAGAAGAG
ATGAAGGCTT TTGTTTCCAA CATCGGTCTT GTGGCCCCTC TTGATGAAAT TATGGCCGAA
TTCTTCGCAG CGAAGATAAT GCGGAAGATG CCGGATAACA GAGTATCGTT TCGGTATCGC
GCGGTCTTGG AATACTTCAT CGCCACTCAG ATTATAAAAG ACGCCGATTT CAAAGATTGG
GTATTGGATG AGACCCGTTA CCTAACTTAC ATCAACGAAC TGCACTATTA CGCAGGCCTT
GTCAGGAACG ACGCCCGGAT TGCTAACCTA CTTCATGAAC GGTTCAACGC AATATTAGAC
GACAATCAGG AACTAGCGCC GTTCGACCCC TCAGACATCG AAACCGTAAA GCTACCTCGG
AAGGGCAGTA GTGAATCCCT TGCACAACTT TCGTCACATG TCCTAGGGCA ACCCCTTACG
AAGGAAGAGA AAGACGCGGC TCTTGATGCG GACATCCCTC GTGACGTGGA AGAACGTCAA
ACTGTTTTCC GTCCAAGGAT TGAGCATCCC GGACATCGTC TACTGGTAGG TCTGTTTCTG
TTCTCTGGCA TCATTAAAAA CATGGAGCAG ATCAGCGCTA CAGATAAGAG TAAACACCTC
AGACTCGCTT GGAAAGGTTG GGCTATGCTT CTTAGCGCAT CGCTCGCGAT GGTTGAAGAA
CTGGCCAAGA AACGACAGAT GCGCATAAAC GGGGTCTTGT ACGAGATACA CGCCCCTCGG
TCGATGACGA ACGAAGAGCT TGGTCGCCGG ATCGCGATCA ATATGCCTAC GAACATTACT
AGGTTCATCG CGGTCTCAAT GGGGACGGAA AAACTACAGC TACAACTGAC GGAACCAACA
CTGGAGGATG CTGACGATCC ACTCGTGTAT GAGTTCTTCC GATCGTGCCT AATCGCGGAA
TTGGGTTTGA GCGTTACCCC CAGCACGGTC AGAAATGCTC TGAAGCGGCT GGCGAAAAGC
GATTATCTCC TAGAGGCGCT TGCTTGGAAG CTAGGAGAGC TGCGGCGGAT GGATCAGATA
TCTCAGCATC ATTTCGAAGC TGTTCAAGCT GACCTATCGG GCACTATCGT CCGCCTCAAC
GGCGGCGACG AATCCAAGGA TAAGAGCCGT CAAATCGAAC AGTATAAAAG GGAAGGCATA
TTGCTAAAGA TGCAACGTCA GAAGGACGAC GCATGA
 
Protein sequence
MEKLAHISQL ELFDGQIVTA PDRITILHVS DFHFNVRKKR EQEIVVKALL DDISKLCIGH 
RKPDVIIFSG DLTQGPPGDT HAEAYDFLLE PLQKATNVSS ERMFIVAGNH DVERDAVAKF
LPEHIKWRST SNDMAAMNTA FESGEFEPAY RAKFANYLDL EDYMAKSSTV FANIFCSVYH
IDALNTDIIV FNSSVLSTGG LDKDDRDEGL LTIPEYAIRD ALSYLKPGTF RIFTTHHPLN
AFTETGASYL SKEIQRNANI HLFGHMHDPS GAQVVGFDGT VITNQAGAVF THRTDWYIGY
ALLCVDRAKG YHETILKSYF PERSKFDDAI DRVADGRFYN SQEARQYWRA LSNPVDDNAF
RKQLSGQVFE DLIKEWSELS LINQAGRQHF VAPRLHKIDT TAPKETRTRL DTLLPFKSLT
ADAGNIILYS PQEYGRTTIL REIQYELLAT SHTIEQPRLP IMIGFDQISQ NPANLERLVR
SRSLMDPQAF GTESLLKLGL VCLLIDDVVF DDHKRMGILR AFISAYPQCR YVFTSLKTSA
APYGAHVVPE TPIRFEFVEL CELKRREMRE LIRLKVQDAA AVEVILDRLH SEISEINLPF
TAANGTILMT IYEQNSNFQP INRSVMIEQF VDATLRKSAA EQTRRETFDY SNKTSLLAHI
AGWMAKQNCY EPAQEALREE MKAFVSNIGL VAPLDEIMAE FFAAKIMRKM PDNRVSFRYR
AVLEYFIATQ IIKDADFKDW VLDETRYLTY INELHYYAGL VRNDARIANL LHERFNAILD
DNQELAPFDP SDIETVKLPR KGSSESLAQL SSHVLGQPLT KEEKDAALDA DIPRDVEERQ
TVFRPRIEHP GHRLLVGLFL FSGIIKNMEQ ISATDKSKHL RLAWKGWAML LSASLAMVEE
LAKKRQMRIN GVLYEIHAPR SMTNEELGRR IAINMPTNIT RFIAVSMGTE KLQLQLTEPT
LEDADDPLVY EFFRSCLIAE LGLSVTPSTV RNALKRLAKS DYLLEALAWK LGELRRMDQI
SQHHFEAVQA DLSGTIVRLN GGDESKDKSR QIEQYKREGI LLKMQRQKDD A