Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_4021 |
Symbol | |
ID | 7387350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 3385594 |
End bp | 3388338 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643652741 |
Product | hypothetical protein |
Protein accession | YP_002550915 |
Protein GI | 222149958 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | [TIGR02302] conserved hypothetical protein TIGR02302 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGGAA ACCGGAGACT GCCCATGACC GCTTCCGGCA CGAAAAAGAG CCAAGTTGCT CTAAACGGCG ATCAAAAGCT GGCCCGCAGG CTGGCCGTGA ACCGCTGGGC GGCAAGGCTC GTGCTGTTTT GTGAGCGGTT GCTACCCCTG CTGCTGCTTC CAGCCTCGAT TGCCGCCATT TTCCTGTCGC TGGCCTGGCT CGGCTTTTTT CGCGAGGCAC CCAGCTATCT GCGCTGGCCG CTGGTTTTCC TGTTGGTTTT CGGCTTTCTC GCCTCGCTTT TGCCTCTGGC CCGTCTGCGC TGGCCTCACA TTGGCGAAGC AGACCGGCTG CTGGAAGAGC GCAATCATCT GCCGCATCAA CCCATCGGCG TGCAGGGTGA GGAACCGGCC TTCGACACGC CTTTTGCCCG CGCCCTCTGG CATGAGCACC AACGGCGCAT GGCGACCCGG ATCGCCACGC TCGATGCCGG ACGGCCAAAG CCTGATATCG CCCGCTTTGA CCGCTTCGGC CTGCGCGCCT TGCCCGCTCT GCTTTTGGCG GTAGCCTTTG CCTATTCCGG CTCGAATGGC GCTGGCCGTC TCACCGATGC CTTTCTGCTG CAAGACAGGG ACGACGCCGC CCCTACCCTG CGCATCGATG CCTGGATTAC CCCGCCGGGC TATACGGGCC GCCCCCCTGT TTTCCTCACG GGACGCAATG GCGATGGCAC CCAGGATATC GCCGTTCCCA CCCAATCGGT GGTCACTGTA CGGTTGACCG GCAGCACCGG AGACGAACAG GTAGACTTCA AACCGGCATC CGGGGCAGCG GCAGTCGCTT TCCAAGCGGT CAAGGCCGAC CAGGAAAACA CGCCCGGCCC CTTGGCCACC CCCTCGCCTA GCGTCTCGTC AGCGACGCCA ACGCCCGCCC AGACGGACGG CAGCGCCCGT ACGCTGACCC TGACATTGGG TGAAAGCGGC ACGCTGGATC TTGGTAACAG GCAATGGCCG CTGAAGGTCA TTCCCGACCA TGGCCCAACC ATCGCCTTTG ACGGCATTCC GAAACGTGCG GTCAACGGAG CGCTGGAAAT CGGCTTTACC GCCCGCGACG ATTACGGTCT GCTGGAAGCG CGCGCCGAGA TCGTCCCTGT GGATGCAGAT CCGCAGGCCA GCCCTCTCTA TCCCCTGCCC GAGTTCAAGC TGGACCTGCC AAGCCGCAAT GCCCGCGACA TCAAAAGCAT ATCGAGCCGC AACCTGACCG AACATCCGCT GGCTGGAAAG AAGGTGCGCA TCACCTTGAT TGCCAAGGAC GGATTGGGCC AGGAGGGCCG CAGCCCAGCC CATGAGATGA TCCTGCCGGG CCGCAATTTT TCCGAACCGC TGGCCGCCGC CGTGGCCGAA CAGCGTCAGG TCTTTGCGCT GGACACGCGG CAGATGCCCC GTGCCTTGGC ACTCAACGAA GCGCTGACGC TGCGCGCCGA TGAAACCATT CCGCAGCTTT CCCATTACCT GCTGATTTCC TCGGCCCATG CCCGTATGCA GATGGCCAGC ACGACCGAAG CCTTGAAGGA CACGGCAGAT TATCTCTGGG AAATCGCGCT TGGCATCGAC GATGGCGATG TGTCACTGGC GGAACGCAAG CTGAGCGAGG CCCAGCAGAA GCTGGCGGAA GCGCTGGAGC GAAATGCGCC CGATGCCGAA ATCAAGAAGC TGATGGACGA GGTCCGTCAG GCAATGCAGG ACTATATGAA GGCTCTTGCA GAGCGCCAGC AGCCGCAAAA CGGCCAGCAG AACCAGCAAA GCGCTCAGAA AATGCTGACC CAGCGCGATC TGGAAAACAT GATGAACCAG ATCGAGAACC TGGCCCGTTC CGGCAACAAG GATGCCGCCC GCCAGATGTT GCAGGAGATG CAACGGATGA TGAACAATCT CCAGGCCGGA CGTCCACAGC GCTCCAATCC GCAGCAGCAA CAGCAGACCA GCGAAGCGCG CAAGCAGATC GACAAGCTCG GCCAGATCAT GCAGGACCAG CAGAAGCTGA TGGACCAGAC TTTCAAACTG GATCAGGAAT TACAGAGCCG GTCACAGATG GGCGATGACC TTCAGCCGGA AGACGGTGAC GGCATGAGCC AGGACAATCC TCAGGCTGAA CCAACGCCCG ATGGCGATCA AAACCAGCAA CAGGGCCAAA ACCCGGACCA GAACCAGAAT AAAGACAGCG CCGGAAAAAA TCCCTCCTCA CCGGATCAGA TGACCGCCGA GCAATTGCGT GAAGCGCTCA AACAATTGCG CCAGCAGCAG GATGCGCTTG GCAAGCAGCT GAAGGGCGTC CAGGACGGAC TTGGCAAGCT CGGCATCAAA CCGGGGGAGA ATTTCGGCCA GGCAGGCCGG GAAATGCAGG GCGCGGGCGA AGCACTCGGC AAAAGCCAAG GAGACCGGGC GGTGCAGGGT CAAGGTCGGG CGCTGGAAGC CCTGCGCCAA GGGGCGCGTG ATATGATGAA CCAGATGATG CAGGCCATGC AGCAAGGCCA GGGGCAAGGT CAGGGCCAGG GGCAAGGCAT GGCGGAAGGC AATCAGGGCG GTCGCGACCC GCTGGGGCGG CCACGGTCAA CCACCGGACC GGATTTTGGC GAGCGCGTCA AAGTCCCCGA CGAAATCGAC GTGCAGCGCG CCCGCGAAAT CCTTGACGCG ATCCGCAACA AGCTGGGCAA TAATGCAAGC CCCGAAGTAG AGCGTCGCTA TCTGGAACGG TTGCTGGACA TGTAA
|
Protein sequence | MTGNRRLPMT ASGTKKSQVA LNGDQKLARR LAVNRWAARL VLFCERLLPL LLLPASIAAI FLSLAWLGFF REAPSYLRWP LVFLLVFGFL ASLLPLARLR WPHIGEADRL LEERNHLPHQ PIGVQGEEPA FDTPFARALW HEHQRRMATR IATLDAGRPK PDIARFDRFG LRALPALLLA VAFAYSGSNG AGRLTDAFLL QDRDDAAPTL RIDAWITPPG YTGRPPVFLT GRNGDGTQDI AVPTQSVVTV RLTGSTGDEQ VDFKPASGAA AVAFQAVKAD QENTPGPLAT PSPSVSSATP TPAQTDGSAR TLTLTLGESG TLDLGNRQWP LKVIPDHGPT IAFDGIPKRA VNGALEIGFT ARDDYGLLEA RAEIVPVDAD PQASPLYPLP EFKLDLPSRN ARDIKSISSR NLTEHPLAGK KVRITLIAKD GLGQEGRSPA HEMILPGRNF SEPLAAAVAE QRQVFALDTR QMPRALALNE ALTLRADETI PQLSHYLLIS SAHARMQMAS TTEALKDTAD YLWEIALGID DGDVSLAERK LSEAQQKLAE ALERNAPDAE IKKLMDEVRQ AMQDYMKALA ERQQPQNGQQ NQQSAQKMLT QRDLENMMNQ IENLARSGNK DAARQMLQEM QRMMNNLQAG RPQRSNPQQQ QQTSEARKQI DKLGQIMQDQ QKLMDQTFKL DQELQSRSQM GDDLQPEDGD GMSQDNPQAE PTPDGDQNQQ QGQNPDQNQN KDSAGKNPSS PDQMTAEQLR EALKQLRQQQ DALGKQLKGV QDGLGKLGIK PGENFGQAGR EMQGAGEALG KSQGDRAVQG QGRALEALRQ GARDMMNQMM QAMQQGQGQG QGQGQGMAEG NQGGRDPLGR PRSTTGPDFG ERVKVPDEID VQRAREILDA IRNKLGNNAS PEVERRYLER LLDM
|
| |