Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_0703 |
Symbol | |
ID | 7388732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 604211 |
End bp | 607426 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643650296 |
Product | phosphohydrolase protein |
Protein accession | YP_002548506 |
Protein GI | 222147549 |
COG category | [R] General function prediction only |
COG ID | [COG1409] Predicted phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.159768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGC TCGCGCATAT ATCGCAACTC GAATTATTCG ACGGACAAAT CGTGACCGCT CCAGACAGAA TTACCATTCT CCATGTCAGT GATTTCCATT TCAACGTGCG CAAAAAGCGT GAACAGGAAA TCGTAGTTAA GGCGTTGCTT GACGACATTT CCAAACTCTG CATCGGCCAT AGAAAACCCG ACGTCATTAT TTTCAGCGGC GATCTTACAC AGGGGCCGCC TGGCGACACC CATGCAGAAG CATACGATTT TCTTCTGGAA CCACTACAGA AGGCGACAAA CGTCTCGAGC GAGAGGATGT TCATCGTTGC GGGCAATCAC GATGTCGAGC GAGACGCCGT CGCTAAGTTC CTACCTGAAC ATATCAAATG GCGATCGACT TCGAACGACA TGGCCGCCAT GAATACCGCG TTCGAGAGTG GTGAGTTCGA ACCGGCATAT CGCGCAAAGT TCGCCAACTA CCTTGATCTC GAAGACTACA TGGCGAAGAG CAGCACCGTG TTCGCGAACA TTTTTTGCTC GGTCTACCAT ATTGACGCGC TGAACACTGA CATCATTGTT TTCAACTCGT CAGTTCTCTC GACTGGCGGT CTCGATAAGG ATGATCGTGA CGAAGGGCTG CTGACGATAC CGGAATACGC AATTCGCGAC GCGCTATCTT ATTTGAAGCC TGGAACGTTT CGGATCTTCA CCACCCATCA CCCGCTGAAC GCTTTCACGG AGACTGGTGC CAGCTACCTC TCGAAGGAGA TACAACGAAA TGCCAATATT CACCTGTTTG GCCATATGCA TGATCCGTCG GGAGCACAGG TCGTAGGCTT CGACGGGACC GTAATAACGA ACCAAGCAGG CGCGGTCTTC ACGCACCGCA CTGACTGGTA TATCGGGTAT GCATTGCTTT GCGTGGATCG AGCCAAAGGG TACCACGAGA CAATACTGAA GAGCTATTTC CCCGAACGCT CCAAGTTCGA TGATGCAATC GATCGTGTGG CAGACGGTCG CTTCTATAAC TCGCAGGAGG CGAGACAGTA TTGGCGAGCG CTGTCGAACC CCGTTGATGA CAACGCCTTC CGCAAGCAAT TGTCCGGTCA GGTCTTTGAA GACCTGATCA AGGAGTGGTC GGAGTTGTCG CTAATCAATC AAGCTGGACG CCAGCACTTC GTAGCGCCGC GCCTCCACAA GATCGACACG ACGGCACCCA AGGAAACGCG CACCCGCCTA GACACACTGC TTCCATTCAA ATCACTCACC GCCGACGCGG GTAACATTAT CCTGTATAGT CCTCAGGAAT ATGGCCGCAC AACGATTCTT CGCGAAATCC AATACGAGTT GCTCGCCACT TCGCATACAA TCGAGCAGCC ACGTCTTCCA ATCATGATCG GCTTCGATCA GATCAGCCAA AATCCAGCCA ATCTAGAAAG GCTTGTACGA AGTCGGTCTC TTATGGACCC ACAGGCATTC GGCACCGAGA GCCTGTTGAA GTTGGGTCTG GTCTGTTTGC TTATTGATGA CGTCGTCTTT GACGACCACA AACGCATGGG AATTTTACGG GCATTTATCT CAGCCTATCC CCAATGCCGA TACGTTTTCA CCAGTCTGAA GACATCGGCA GCCCCCTACG GCGCTCATGT CGTTCCGGAA ACGCCGATCC GTTTCGAATT TGTGGAACTC TGCGAACTCA AACGGAGGGA GATGCGAGAA CTCATCCGCC TCAAGGTACA AGATGCCGCT GCGGTGGAAG TCATCCTCGA CCGCCTTCAC TCGGAGATAT CAGAGATCAA TCTGCCCTTC ACCGCTGCCA ATGGCACGAT CTTAATGACG ATCTACGAGC AAAACAGCAA TTTTCAGCCA ATTAATCGAT CCGTAATGAT TGAACAATTC GTAGATGCAA CTTTGCGCAA GAGTGCTGCG GAACAAACGC GACGCGAAAC GTTTGACTAC TCCAACAAGA CATCGCTGCT CGCGCACATA GCGGGCTGGA TGGCAAAGCA AAATTGCTAC GAACCCGCCC AGGAAGCGCT GCGAGAAGAG ATGAAGGCTT TTGTTTCCAA CATCGGTCTT GTGGCCCCTC TTGATGAAAT TATGGCCGAA TTCTTCGCAG CGAAGATAAT GCGGAAGATG CCGGATAACA GAGTATCGTT TCGGTATCGC GCGGTCTTGG AATACTTCAT CGCCACTCAG ATTATAAAAG ACGCCGATTT CAAAGATTGG GTATTGGATG AGACCCGTTA CCTAACTTAC ATCAACGAAC TGCACTATTA CGCAGGCCTT GTCAGGAACG ACGCCCGGAT TGCTAACCTA CTTCATGAAC GGTTCAACGC AATATTAGAC GACAATCAGG AACTAGCGCC GTTCGACCCC TCAGACATCG AAACCGTAAA GCTACCTCGG AAGGGCAGTA GTGAATCCCT TGCACAACTT TCGTCACATG TCCTAGGGCA ACCCCTTACG AAGGAAGAGA AAGACGCGGC TCTTGATGCG GACATCCCTC GTGACGTGGA AGAACGTCAA ACTGTTTTCC GTCCAAGGAT TGAGCATCCC GGACATCGTC TACTGGTAGG TCTGTTTCTG TTCTCTGGCA TCATTAAAAA CATGGAGCAG ATCAGCGCTA CAGATAAGAG TAAACACCTC AGACTCGCTT GGAAAGGTTG GGCTATGCTT CTTAGCGCAT CGCTCGCGAT GGTTGAAGAA CTGGCCAAGA AACGACAGAT GCGCATAAAC GGGGTCTTGT ACGAGATACA CGCCCCTCGG TCGATGACGA ACGAAGAGCT TGGTCGCCGG ATCGCGATCA ATATGCCTAC GAACATTACT AGGTTCATCG CGGTCTCAAT GGGGACGGAA AAACTACAGC TACAACTGAC GGAACCAACA CTGGAGGATG CTGACGATCC ACTCGTGTAT GAGTTCTTCC GATCGTGCCT AATCGCGGAA TTGGGTTTGA GCGTTACCCC CAGCACGGTC AGAAATGCTC TGAAGCGGCT GGCGAAAAGC GATTATCTCC TAGAGGCGCT TGCTTGGAAG CTAGGAGAGC TGCGGCGGAT GGATCAGATA TCTCAGCATC ATTTCGAAGC TGTTCAAGCT GACCTATCGG GCACTATCGT CCGCCTCAAC GGCGGCGACG AATCCAAGGA TAAGAGCCGT CAAATCGAAC AGTATAAAAG GGAAGGCATA TTGCTAAAGA TGCAACGTCA GAAGGACGAC GCATGA
|
Protein sequence | MEKLAHISQL ELFDGQIVTA PDRITILHVS DFHFNVRKKR EQEIVVKALL DDISKLCIGH RKPDVIIFSG DLTQGPPGDT HAEAYDFLLE PLQKATNVSS ERMFIVAGNH DVERDAVAKF LPEHIKWRST SNDMAAMNTA FESGEFEPAY RAKFANYLDL EDYMAKSSTV FANIFCSVYH IDALNTDIIV FNSSVLSTGG LDKDDRDEGL LTIPEYAIRD ALSYLKPGTF RIFTTHHPLN AFTETGASYL SKEIQRNANI HLFGHMHDPS GAQVVGFDGT VITNQAGAVF THRTDWYIGY ALLCVDRAKG YHETILKSYF PERSKFDDAI DRVADGRFYN SQEARQYWRA LSNPVDDNAF RKQLSGQVFE DLIKEWSELS LINQAGRQHF VAPRLHKIDT TAPKETRTRL DTLLPFKSLT ADAGNIILYS PQEYGRTTIL REIQYELLAT SHTIEQPRLP IMIGFDQISQ NPANLERLVR SRSLMDPQAF GTESLLKLGL VCLLIDDVVF DDHKRMGILR AFISAYPQCR YVFTSLKTSA APYGAHVVPE TPIRFEFVEL CELKRREMRE LIRLKVQDAA AVEVILDRLH SEISEINLPF TAANGTILMT IYEQNSNFQP INRSVMIEQF VDATLRKSAA EQTRRETFDY SNKTSLLAHI AGWMAKQNCY EPAQEALREE MKAFVSNIGL VAPLDEIMAE FFAAKIMRKM PDNRVSFRYR AVLEYFIATQ IIKDADFKDW VLDETRYLTY INELHYYAGL VRNDARIANL LHERFNAILD DNQELAPFDP SDIETVKLPR KGSSESLAQL SSHVLGQPLT KEEKDAALDA DIPRDVEERQ TVFRPRIEHP GHRLLVGLFL FSGIIKNMEQ ISATDKSKHL RLAWKGWAML LSASLAMVEE LAKKRQMRIN GVLYEIHAPR SMTNEELGRR IAINMPTNIT RFIAVSMGTE KLQLQLTEPT LEDADDPLVY EFFRSCLIAE LGLSVTPSTV RNALKRLAKS DYLLEALAWK LGELRRMDQI SQHHFEAVQA DLSGTIVRLN GGDESKDKSR QIEQYKREGI LLKMQRQKDD A
|
| |