Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4029 |
Symbol | |
ID | 5706433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4582182 |
End bp | 4583126 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641273454 |
Product | homoserine kinase |
Protein accession | YP_001538810 |
Protein GI | 159039557 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00116733 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCGACCA ACTTCATGCC CGACCCGGTC CGGGTGCGAG TCGCCGCCAC CAGCGCCAAC CTGGGGCCTG GCTTCGACGC CGTCGGGCTT GCCCTTGGAC TGTACGACGA CATCAGCGCC GAGGTCACCT CCGGCGGCGT CACGGTGCGA ATCGCCGGCG AGGGCGCCGG TGACCTGCCC GGCGACGAGC GGCATCTCGT GGTGCGTGCG ATGCGGGCGG CCTTTGACGA GTGCGGTGGC CAACCGGCGG GGCTGGCGGT GGAGTGCGTC AACCGGATTC CCCAGGCTCG CGGCCTTGGC TCGTCGTCGG CGGCGATCGT GGCCGGAGTG CTGTTGGCCC GGGCGTTGGT GATGGACGGA GAACGGCGAC TGGACGACGC GGCCGTGCTT CGGCTCGCGG CCCGCCTCGA GGGCCACCCG GACAATGTTG CGCCCTGCCT GCTGGGCGGC TTCACCATCG CGTGGACTGA GTCGGAAGGT GCCAAGGCGG TGTCCCTGCC GGTTGCCGCC GGGGTCCGGC CGACGGTGTT CGTGCCGACG GGGCGCGGGC TCACCGCCAC CGCGCGGGCC GCGCTGCCGG CCACCGTGCC GCACCTGGAC GCGGCCTCCA ACGCGGGCAG GGCCGCCCTG CTGGTCCACG CGCTCAGCGT GGCGCCGGAG CTGTTGCTGC CGGCCACGGC CGACCGGTTG CACCAGGACT ACCGGGCCGA GACCATGCCA GCGACGGCCG CGCTGGTTGC CGCACTACGC GCGGCGGATG TGCCAGCGGT ACTGTCCGGG GCGGGCCCCA GCGTGCTGGC GCTGCGCGAG CCGCCCGCCG ATCTGGCGGC AGGGCCGGAC TGGCAGGTGT GGTCGTTGCC GGTAGAGGTG CGCGGTGCCC GGGTCGGGCG GGGTAGACTA GGACACGCGG AACGGGATCC TGTTGCCGCA GGTCGGAAGA GTTGA
|
Protein sequence | MSTNFMPDPV RVRVAATSAN LGPGFDAVGL ALGLYDDISA EVTSGGVTVR IAGEGAGDLP GDERHLVVRA MRAAFDECGG QPAGLAVECV NRIPQARGLG SSSAAIVAGV LLARALVMDG ERRLDDAAVL RLAARLEGHP DNVAPCLLGG FTIAWTESEG AKAVSLPVAA GVRPTVFVPT GRGLTATARA ALPATVPHLD AASNAGRAAL LVHALSVAPE LLLPATADRL HQDYRAETMP ATAALVAALR AADVPAVLSG AGPSVLALRE PPADLAAGPD WQVWSLPVEV RGARVGRGRL GHAERDPVAA GRKS
|
| |