Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1697 |
Symbol | |
ID | 4599736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 1803313 |
End bp | 1804188 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639776296 |
Product | histidinol-phosphate phosphatase |
Protein accession | YP_922897 |
Protein GI | 119715932 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.893079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGATCT CCACGGACTA CACGGACGAC CTGCGGCTCG CCCACGTCCT CGCCGACGAT GCCGACTCCC TGACCGAGGC CCGCTACAAG GCCCTCGACC TGCACGTGAT GAGCAAGCCC GACCTGACGC CGGTCACCGA CGCCGACCAG GCGGTCGAGG AGAGCATCCG GCGCACGCTG TCCCGGGTGC GCTCTCGCGA CGCCGTCACG GGCGAGGAGC AGGGCTCCAC CGGCCACAGC CAGCGGCGCT GGGTCATCGA CCCGATCGAC GGCACCAAGA ACTTCGTCCG CGGCGTCCCG GTGTGGGCCA CGCTGATCTC CCTGGTCGTC GACGACCAGG TGGTGGTCGG CGTCGTCTCC GCTCCGCTGC TCCAGCGCCG CTGGTGGGCG TCGATCGGCA GCGGCGCATG GACCGGCCGG TCGCTGCTCA AGGCGACCCG CTGCCAGGTG TCCGACGTCC GCCGCCTCGA GGACGCCTCG CTCTCCTACG CCTCCTTGCA CGGGTGGGAC GAGCGCGACC GGCTCGACGA CTTCCTGTCC CTGATGCGCC GCTGCTGGCG CACCCGGGCG TACGGCGACT TCTGGTCCTA CATGCTGCTC GCCGAGGGGG CGGTCGACAT CGCGGCCGAG CCCGAGCTCG CCATCTACGA CATGGCCGCG CTGGCGGTGA TCGTCAGCGA GGCGGGCGGC CGGTTCACCT CCCTGGACGG CACCGACGGA CCGTTCGGCG GCAACGCGCT CGCCACGAAC GGCCACCTCC ACGAGGCCGC CCTCTCGTTC CTCGCGGCGC TCCCCGACGA CGAGGACGAC CCCGACTCGC GGCGCACCGG CCCCGGCTCG GTCTCCGACC TGCGGCTGCA CCGCCCCCGG GACTGA
|
Protein sequence | MPISTDYTDD LRLAHVLADD ADSLTEARYK ALDLHVMSKP DLTPVTDADQ AVEESIRRTL SRVRSRDAVT GEEQGSTGHS QRRWVIDPID GTKNFVRGVP VWATLISLVV DDQVVVGVVS APLLQRRWWA SIGSGAWTGR SLLKATRCQV SDVRRLEDAS LSYASLHGWD ERDRLDDFLS LMRRCWRTRA YGDFWSYMLL AEGAVDIAAE PELAIYDMAA LAVIVSEAGG RFTSLDGTDG PFGGNALATN GHLHEAALSF LAALPDDEDD PDSRRTGPGS VSDLRLHRPR D
|
| |