Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1942 |
Symbol | thrB |
ID | 5136666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2072106 |
End bp | 2073062 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640533399 |
Product | homoserine kinase |
Protein accession | YP_001217866 |
Protein GI | 147673067 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.17619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTTG TGGTGTACGC ACCCGCTTCC ATCGGTAATG TCAGTGTCGG ATTTGATGTG TTGGGGGCCG CGGTGTCCCC GATTGATGGC ACGTTACTCG GCGATCGAGT GAAAGTGGAA GCTGGCGCAG AGGCTTTCAC ACTGAAAACG GCGGGGCGAT TTGTCGATAA GCTTCCCGCT AATCCGCAAG AAAACATTGT GTATGACTGC TGGCAAGTAT TCGCGCGTGA GCTAGAGAAA AAGTCGGTGG TACTGAAGCC GCTGACCATG ACGCTAGAAA AAAATATGCC GATTGGTTCA GGATTGGGCT CCAGTGCTTG CTCGATTGTC GCGGCTCTGG ATGCGTTGAA TCAGTTTCAC GCCAGTCCGC TTGATGAAAC TGAGCTTCTG GCTCTGATGG GGGAAATGGA AGGCAAGATC TCTGGCAGTA TTCACTATGA CAACGTTGCT CCCTGCTATT TAGGTGGCGT GCAGTTGATG CTCGAAGAAC TCGGTATCAT TAGTCAATCC GTGCCGAGTT TTGATGACTG GTATTGGGTG ATGGCCTATC CGGGGATCAA GGTGTCCACG GCGGAAGCGC GTGCCATTTT GCCTGCGCAA TATCGCCGCC AAGATATTGT GGCGCATGGC CGCTATCTGG CAGGATTTAT TCACGCTTGC CATACTCAGC AGCCTGAATT AGCGGCAAAA ATGATCAAAG ACGTGATTGC CGAACCCTAT CGTGAGAAAC TGCTGCCGGG TTTTGCCAAA GCGCGCAGCT ACGCCGCTGC GGCTGGCGCA CTGGCAACGG GCATTTCGGG CAGTGGTCCG ACTTTGTTTA GCGTGTGCAA AGAACAAGCA GTGGCTGAAC GTGTGGCACG TTGGCTTGAG CAAAACTATG TGCAAAATGA AGAAGGATTC GTCCACATTT GTCGTCTAGA CAAGCAAGGT TCGAAAGTAA CAGGAAGTGA GCTATGA
|
Protein sequence | MSVVVYAPAS IGNVSVGFDV LGAAVSPIDG TLLGDRVKVE AGAEAFTLKT AGRFVDKLPA NPQENIVYDC WQVFARELEK KSVVLKPLTM TLEKNMPIGS GLGSSACSIV AALDALNQFH ASPLDETELL ALMGEMEGKI SGSIHYDNVA PCYLGGVQLM LEELGIISQS VPSFDDWYWV MAYPGIKVST AEARAILPAQ YRRQDIVAHG RYLAGFIHAC HTQQPELAAK MIKDVIAEPY REKLLPGFAK ARSYAAAAGA LATGISGSGP TLFSVCKEQA VAERVARWLE QNYVQNEEGF VHICRLDKQG SKVTGSEL
|
| |