Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3151 |
Symbol | |
ID | 4600136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3352466 |
End bp | 3353395 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639777757 |
Product | heat shock protein HtpX |
Protein accession | YP_924340 |
Protein GI | 119717375 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.184609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACGCA CGCGTTTCGT CGGCGACGCC GGGCTGACCG CCCGGATGAC GCTGGTGATG TTCCTGCTCG GCGCCCTGTT CGTCGGGCTG ATCGCGATCG TCGGGACGAT CTTTGCGAGC TCCTTCGGCA TGGGAGCCGG GGTGCTCATC GGCGTCGTCG GTCTCGTGAT CGTCTGGTTC CAGTGGTACA AGTCCGACAC CGTCGCGATG AAGGCGATGC GGGCCCGCGA GGTCAGCCGC GAGCAGGCCC CGGAACTGCA CGACATGATC GAGCGGCTCT GCGCGCTCGC CGACATGCCC AAGCCTCGCG TCGGCGTCGC CGACACCGAT CTGCCCAACG CGTTCGCCAC CGGCCGGTCC CCGCAGCGCT CGGTCGTGGT GGTCACCACC GGCATCCTGC GCCGGCTCTC CGCCGAGGAG CTCGAGGGCG TGCTCGCCCA CGAGCTCTCG CACGTCGCCC ACCGCGACGT CCTGGTGATG ACGGCCGCCT CGAGCGCCGG CATCGTCGCG GGCATGCTCA CCCGCGGCTC GCAGTACGGC GCGTTCTTCG GTGGCGGCCG CCGCGACAAC AACAGCGGCG GGCTCCCGGT GTGGCTGGTC GTGCTGGTGG TCAGCCTGGT GACGTACGCC GTCAGCTTCC TGCTACTCAA GCTGCTCTCG CGCTACCGCG AGCTGTCGGC GGACCGGGCC GGCGCCTACC TCACGATGAA GCCGCAGGCC CTGGCCTCCG CGCTGCAGAA GATCACCGGC GAGATCAACC AGATCCCGCA GCGCGACCTG CGCCAGGCCA GCGCGATGAA CGCGTTCTTC TTCGCCCCCG CGATCCAGGG CGTCTCGCTG CGCACCCTGA CCTCGACCCA CCCGACCCTC GAGCAGCGGC TCGAGCAGCT CGCCAAGATC CAGGCCGAGC TCGGCCGCCC GGCGGCCTGA
|
Protein sequence | MARTRFVGDA GLTARMTLVM FLLGALFVGL IAIVGTIFAS SFGMGAGVLI GVVGLVIVWF QWYKSDTVAM KAMRAREVSR EQAPELHDMI ERLCALADMP KPRVGVADTD LPNAFATGRS PQRSVVVVTT GILRRLSAEE LEGVLAHELS HVAHRDVLVM TAASSAGIVA GMLTRGSQYG AFFGGGRRDN NSGGLPVWLV VLVVSLVTYA VSFLLLKLLS RYRELSADRA GAYLTMKPQA LASALQKITG EINQIPQRDL RQASAMNAFF FAPAIQGVSL RTLTSTHPTL EQRLEQLAKI QAELGRPAA
|
| |