Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_6014 |
Symbol | |
ID | 7380841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 1017460 |
End bp | 1019673 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643649516 |
Product | exopolysaccharide polymerization/transport protein |
Protein accession | YP_002547747 |
Protein GI | 222106956 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.807866 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGG ACATCGACAT TTTTCAAATC CCCGGGATCT TGCGACGGCG CTGGTATTAC CTGGCGTTTT TTGCCGCGCT GTTTGCCGGT CTGGCGCTGC TCTATGCGCT CAGCCTGAAG CCTGTCTATG TATCGTCCAC CCAGATCCTG CTCGATCCGC GCGGCCTGTC TGCCACCAGT AGCGACAGCC GACAGCCGAC AGTCGCCGTA CAAAGCGACC CGGCCAGCCT CGACAGCCAG ATCTATGTCG TCCTGTCCAG CGCCGTGCTG GGGGAAGTGG TCAACCGGCT CGACCTGACG AAAGACTCCT ATCTCTATGC GGGAAAGCCA AGCAGCGCCG TATCCCCCGC TGAGGTCATG GCTGCCACCA TCGGCGGGCT GGTCCGGCAT GTGAAAGTCG AGCGCGAAGG CCAGTCCTTC ATCATGTCGA TCACGGTCGA ACACCGCATC GCCAAAACGG CAGCCGACAT TGCCAATATG ATTGCCACCG TCTACCTGAA ACAGGTGGAC GAAGCCCGCT CCGACGCAGC GCGCCGGGCA AGCGCCGCCT TCCAGGCGCA GGCCAGTGAA TTGCGCGATC GGGTGCTGAA GGCCGAAAGG GCGGTCGAGG AATTCCGATC CGCCAACGGT CTGGCCAGCA CCGGCGTCAC CGGACTGGTG ATCGACCAGC AACTAGCAGG CCTGAACCAG CAGTTGATCG CAGCACGCGG CGCGGAAGAA CAGCAGCAGG CCATTTACCA GCAGACCCGC AATCTCACGG TCGCTGCCGT TGAAAACGGC AATATCCCAG AAGCGGTACA ATCCACTACT GTCGGGCTGC TGCGCGACCG CTATGTCCAG CTACAGGACC GCCAGGCCGA AGCGTCCGCC AATCTCGGCG GCAACCATCC GCAACTGAAG GCGATCAATT CGCAGGTGGC CAGCATGCGC CAGGCCATCC AGCAGGAGCT GGACCGGGTG CGCCAGTCGA TGAAGCTCAA CTATGACCGG GCGGTTGCCA ACCGCAAGGC GCTGGAAACC CAGCTGCAAA GCCTGACGAA AACCAGTTTC GACAGCGGGG CGCGGCAGAT CACCCTGCGC CAGCTGGAAA GCGAGGCGGA AGCCATCCGC ACCATTTACA AGGCCTTCCT CAACCGCGCC GAGGAGCTGA GCCAGGAACA GACGATCTCC ATCAACAATT CCCGGGTTAT CACCGAGGCA GTGGCGACAG CGAAATCGGT CACGACCCTC AAAGTGATGA TCCTTGCCGC CGCCATCCTG TTTGGTCTGG CCTTCGGCAG CACGCTGGCG GTGGTGCTGG AACTCCTGTC GCGCAAGGAT GTCGCGCCGC AAGCAGGCAT CGCCGCGCCT GCCACGGCGA CCTCATCGCC TCCTGGCAAA GGCGAACCAC CCCCAGCACC GCCCATTGCG TCAGCAAGAC ATATCGCTCT GATTGCCGAC GCTACGGAAC CTGAAAAACA GAAGTCCCGC AATCCGTTCA GCTTTATCAC CGCCTTTGGC CGTCGGCTGG TCTCGCCTCT TGTTCCCGCA TCCAACCCGG CAACCGCATC ACAACCGGCG GCAGGCGGCG CCTGGTCCCA TGCGGTCGCC AGCACAGCCG GTTTTCTGAT TGAATGCGGT GAAGGCTATG CTGATCTGAC CGTTCTGTTT GTTGCAGCGG GCAGGCCGGT GGCCAGCGCC TTTATTGGCG ATGTGGCACA GAAGCTGGCT GATCGGGACC GCGGCGTGCT GCTGGCCAAT GGTGCCATGC TGGACCACCG CTTAGCGATC CGCTCGCACA GAAAAACCAA TGCCCGGCCA AGCCTTGCCC AGGCATTGCA GCAGCCCGAT CTCGAAGACG CACCTCTCTC GCACATCCTG CGCTACGAGC GCATCGCCCT GCCCCGCAAC CAGCCAGCGC GGCCAGCAGC GGGAGCATCG GCCTCCTCCG GTCGCCCGAC CTATTCGCGA TTTGTCGAGC AAAGCCTTCA GGCGGAAACC GATTTCACCT TGATTAATGC CTGCGGCGCT TGGATCAATG CCCGCGGCGC CGGAATCAAT GCTAGCGGTG CTGGTGGCGA GCAGCATCTT ACTGCCCTTG CCGCCGAGGC CGATGTCATT CTCGTCTTGA CCACGGCGCA AGACGAGGCC GCCGCCCTGG ATGAACTGCT GATGCGGCTG GGCGAAGACG CAGAACGGGT CGTGGGTCGT ATCGTGCTCG AGACCGCCGG ATGA
|
Protein sequence | MKMDIDIFQI PGILRRRWYY LAFFAALFAG LALLYALSLK PVYVSSTQIL LDPRGLSATS SDSRQPTVAV QSDPASLDSQ IYVVLSSAVL GEVVNRLDLT KDSYLYAGKP SSAVSPAEVM AATIGGLVRH VKVEREGQSF IMSITVEHRI AKTAADIANM IATVYLKQVD EARSDAARRA SAAFQAQASE LRDRVLKAER AVEEFRSANG LASTGVTGLV IDQQLAGLNQ QLIAARGAEE QQQAIYQQTR NLTVAAVENG NIPEAVQSTT VGLLRDRYVQ LQDRQAEASA NLGGNHPQLK AINSQVASMR QAIQQELDRV RQSMKLNYDR AVANRKALET QLQSLTKTSF DSGARQITLR QLESEAEAIR TIYKAFLNRA EELSQEQTIS INNSRVITEA VATAKSVTTL KVMILAAAIL FGLAFGSTLA VVLELLSRKD VAPQAGIAAP ATATSSPPGK GEPPPAPPIA SARHIALIAD ATEPEKQKSR NPFSFITAFG RRLVSPLVPA SNPATASQPA AGGAWSHAVA STAGFLIECG EGYADLTVLF VAAGRPVASA FIGDVAQKLA DRDRGVLLAN GAMLDHRLAI RSHRKTNARP SLAQALQQPD LEDAPLSHIL RYERIALPRN QPARPAAGAS ASSGRPTYSR FVEQSLQAET DFTLINACGA WINARGAGIN ASGAGGEQHL TALAAEADVI LVLTTAQDEA AALDELLMRL GEDAERVVGR IVLETAG
|
| |