Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3722 |
Symbol | kpsF |
ID | 7388192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 3085833 |
End bp | 3086828 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643652507 |
Product | capsule expression protein |
Protein accession | YP_002550689 |
Protein GI | 222149732 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGC TGGCTGTAAA ACTCGTGGAA GCAAGCGCCA TCAAGGCTGC ATTGCGTGTC GTCGCTACCG AACAAAGCGG GCTGGCGGCC CTGGAAGAGG CTTTGGCAGG GTATCTTGCC GGGCCGTTTT GCAACGCAAT CGATGTTATC GGCAAAAGCT CGGGCCGGGT GATCGTCTCT GGCGTCGGCA AGAGCGGCCA CATCGGCGGC AAGATCGCGG CAACGTTCGC CTCGACAGGC ACACCGGCTT TCTTCATCCA TCCGGCGGAA GCCAATCATG GCGACCTCGG CATGATTGCC CGTGACGATG TGGTGATTGC CCTGTCCTGG GGCGGCGAAA GCACCGAGCT GAACGGAATC CTGTCCTTCA CTCGCCGGTT TTCCATTCCA CTGATAGCCA TTACCGCCGG TGAGCAATCC ACGTTGGCAC GTGAAGCCGA TATCGTGCTT TTGATGCCCA AGGTGCAGGA AGCCTGCCCG CATGGCTTGG CGCCGACCAC CTCCACCATG ATGCAGATGG CGCTGGGCGA TGCGCTGGCG CTGGCGCTGC TGGAAGCTCG TGGCTTCGGG CCGAATGATT TCAAGACCTT CCATCCGGGC GGCAAGCTGG GCGCGATGCT GACTCATGTC GGCGACATGA TGCATATCGG CGAGGACGTG CCGCTGGTGC CGGAAGGCAC ATCGGTGCCG GAAGCGATTA TCATGCTATC GCAGAAGCGT TTTGGCTGCG TCGGCGTCAC CGACAGCGCT AACCGGTTGG TCGGTATCAT CACCGATGGC GATATTGCCC GTAACCTCAA CCGCAATCTC GGTGAGCGGA TGGTGGAAGA GGTGATGACC CGTCACCCAA AGACGGTTCA CACCGAAACA CTTGCGACCA CCGCCATGGC GATCCTCAAC CAGCACAATA TTTCGGCGCT GTTTGTGACC GATGAAGACG GTGTGCCAAA CGGCATCATC CACTTCCATG ATTTGTTGCG GATTGGTGTG GCTTAA
|
Protein sequence | MNKLAVKLVE ASAIKAALRV VATEQSGLAA LEEALAGYLA GPFCNAIDVI GKSSGRVIVS GVGKSGHIGG KIAATFASTG TPAFFIHPAE ANHGDLGMIA RDDVVIALSW GGESTELNGI LSFTRRFSIP LIAITAGEQS TLAREADIVL LMPKVQEACP HGLAPTTSTM MQMALGDALA LALLEARGFG PNDFKTFHPG GKLGAMLTHV GDMMHIGEDV PLVPEGTSVP EAIIMLSQKR FGCVGVTDSA NRLVGIITDG DIARNLNRNL GERMVEEVMT RHPKTVHTET LATTAMAILN QHNISALFVT DEDGVPNGII HFHDLLRIGV A
|
| |