Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_2378 |
Symbol | |
ID | 7386019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 1946201 |
End bp | 1947682 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643651512 |
Product | hypothetical protein |
Protein accession | YP_002549701 |
Protein GI | 222148744 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.946699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCAAA CATCTTTTCA TCACCTGATT TCTCCCTCTT CCATGGCGCT GGTGGACCTT GATGCGGCCA ACTCGGGCAT CGATAGTTAT GGCCTGATGC GCAAGGCCGG TTCTGCCGTG GCGGCAGCGG CACTCCGGCT TTTCCCCCAG GCGCTGAGGG TGGCGGTGCT GTGTGGGCCG GGAAACAATG GCGGTGACGG CTATGTTGCG GCTGAAGCGC TGCGTCAATC CGGGGTTTTG GTGCAGGTCT TTTACCTGGG CGAGCCGGAA AAGTTGAATG GAGATGCCGC TCTGGCTTTT GCCGACTACC AGGGCAAGGC CGAACCCCTA GCTCTCTATG ATCCTCAGCA GGGGGATCTG GTGGTAGACG CTTTGTTCGG AGCAGGCCTC GCGCGGTACT TGCCTTTGCC GGTGACAGAA CTGATCGAGC GGGTCAATCG GGTCGGTATC GCCGTTGTCG CGGTCGATCT GCCCTCCGGT ATAGACGGCC GCACGGGCCA GCCCCGCCCG GTGGCGTTTC AGGCAGCCCA TACGGTGACA TTCATGGCAA GAAAGCCGGG ACATGTGCTG CTACCTGGAC GTTCGCTCTG CGGCACGGTG GAGATCTACG ATATCGGCAT CCCTTGTCGC ACCATTGAAC AACATCGAGG AGACGTTGCT GTCAATCATC CCGATTTATG GGCGCATCTC CTGCCAAGGA TCTCCGGCGC TAGCCATAAA TTCACCCGCG GCCATCTTAC GGTGTTTTCC GGGCGCAGCA GCGCGACAGG TGCCGCGCGG CTGTCCGCTA TGGCAGGATT GAAGGCCGGG GCCGGGCTGG TGACACTGGC CTCACCGGCC AGTGCCGTTC TTGTCAATGC CGCGCAGACC ACTGCCGTGA TGGTGAAGGC CATCAATGAT CTTGATGACC TTGAGGACTA TCTGACTGAC CAGCGGATGA GCGCCTTCGT GCTCGGACCC GGATTTGGGA TTGGCGAAAA GGCGCGGGAA TTCACCCTTT CGCTGTCCAA ACGCAGGCTG ATTCTCGATG CCGACGGTAT TTCTTCTTTT AGGGATCAAC CAGACGAGCT TTTCGACGCA TTTTCCGGCA ATGAAACCCG CTTGGTTCTG ACGCCGCATG AGGGCGAGTT TGCTCGCCTG TTTGCCGATA TTGCCGGTGA GAAGACGTTG GGCAAAGTTG AAAAGGCCCA GGCTGCCGCC AGAAAAGCCA ATGCGGCGGT GGTCTACAAA GGTGCAGACA CAGTGATTGC CGCTCCCGAT GGCCGGGCGC TGATCAACGA AAATGCGCCG CCGTGGCTTG CCACAGCCGG TTCCGGCGAT GTGTTGGCCG GGATCATCGG CGGACTACTA GCGCAAGGCG TTCCGGCCTT TGAGGCTGCG GCAGCAGGTG TCTGGCTGCA TGCTGAAACG GGCGCAAGGC TCGGCGAGGG ATTGACAGCA GAGGATCTGG CTGCGGCTGT GAAACCCTTT CGCCAAGGCT AA
|
Protein sequence | MMQTSFHHLI SPSSMALVDL DAANSGIDSY GLMRKAGSAV AAAALRLFPQ ALRVAVLCGP GNNGGDGYVA AEALRQSGVL VQVFYLGEPE KLNGDAALAF ADYQGKAEPL ALYDPQQGDL VVDALFGAGL ARYLPLPVTE LIERVNRVGI AVVAVDLPSG IDGRTGQPRP VAFQAAHTVT FMARKPGHVL LPGRSLCGTV EIYDIGIPCR TIEQHRGDVA VNHPDLWAHL LPRISGASHK FTRGHLTVFS GRSSATGAAR LSAMAGLKAG AGLVTLASPA SAVLVNAAQT TAVMVKAIND LDDLEDYLTD QRMSAFVLGP GFGIGEKARE FTLSLSKRRL ILDADGISSF RDQPDELFDA FSGNETRLVL TPHEGEFARL FADIAGEKTL GKVEKAQAAA RKANAAVVYK GADTVIAAPD GRALINENAP PWLATAGSGD VLAGIIGGLL AQGVPAFEAA AAGVWLHAET GARLGEGLTA EDLAAAVKPF RQG
|
| |