Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5930 |
Symbol | exoH |
ID | 7381020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 947333 |
End bp | 948409 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643649448 |
Product | succinoglycan biosynthesis protein |
Protein accession | YP_002547679 |
Protein GI | 222106888 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGATCG TCGGCATTGT CTTCGTACAT GTCCCCTTCG ATCCGCAGTC GAACGCCTAT CTGGGCCAAT ACGGCGCCTT TGATTGGCTG CGCGCCTTTC TCGGTGACAT GCTGTTTCGC ATTGGCGTTC CCTGCCTGAG TGCGATTTCC GGCTTGCTGA TGTTTCGCAA AGGGCTGGAC GGTTTCGATT ACAGGAAAGT GCTGCGCTCA AAAAGTGTGA GCGTATTGCT GCCGTTCCTG CTCTGGAACA GCATCGTTTT CCTGTATGTG CTGACGACGC AATCTCTGGG CTATGGCGAA GGATATTTCC CAAGCGTACT GACCGCGTCT TATCGGGAAC TCGTTACCCT GGTGCTGGCG ACGGAAGATT GGCCAATCAA TCTGCCGCTC TATTTCCTGC GTGATCTGCT GCTCTGCTTC CTCCTATCGC CGCTGATTGG CTTCCTGGTG AAACGCCAAC CCGCACTGAC GCTTGGAGTT CTGTTGCTTT ATGTCGTCCT ACCGGTGCCC AATATGATCT TCCTGAAAAA ATCGATCATT CTCGGGTTTT CGGCTGGTGC GGCGATTGCG ATACACAATG TCGATATCCG CAAGCTCGAC CGTTATGCCT TGCCGATTGT GGCAGCCGTG CTGGTCTCGG CCATGGCGGT CTTCGTCTGC ATGTATCAAG CTGGCTTTAA CGAGCACCCA CTCTGGCTTG ATGTGATATA TGGTCTGATC ACGGTGATTG GCGGCCTGGG AGCCTGGGAA ATGACCCGGC TGCTCAAGAA TACGCGCCTT GGCAAAACAC TGGTCGCCGC ACCCGGCGGA CTTAGTTTCT GGATTTTCTG TGCGCATTAT CCGATCTTGA TGACCGCATG GATGATCTGG CAAAAATTCG ATATAGGGCC TTACCCCCTC TTTTACGTCC TTGCGCTACC CGTCACCTTC TGCCTGCTGC TTGCCTCCCA TTTCCTGGCG ATCCGGCTGA CGCCGGGATT TTACGGGGTC ATGACCGGCC AGCGCAATGC AAGCGGTAGA GACAAGAACA AGAACGTGGC AGCCGGGCGG GCGCTTGCCT TCAGCAAAGG AGACTGA
|
Protein sequence | MLIVGIVFVH VPFDPQSNAY LGQYGAFDWL RAFLGDMLFR IGVPCLSAIS GLLMFRKGLD GFDYRKVLRS KSVSVLLPFL LWNSIVFLYV LTTQSLGYGE GYFPSVLTAS YRELVTLVLA TEDWPINLPL YFLRDLLLCF LLSPLIGFLV KRQPALTLGV LLLYVVLPVP NMIFLKKSII LGFSAGAAIA IHNVDIRKLD RYALPIVAAV LVSAMAVFVC MYQAGFNEHP LWLDVIYGLI TVIGGLGAWE MTRLLKNTRL GKTLVAAPGG LSFWIFCAHY PILMTAWMIW QKFDIGPYPL FYVLALPVTF CLLLASHFLA IRLTPGFYGV MTGQRNASGR DKNKNVAAGR ALAFSKGD
|
| |