Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1522 |
Symbol | |
ID | 4595673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1616182 |
End bp | 1617318 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639776120 |
Product | hypothetical protein |
Protein accession | YP_922723 |
Protein GI | 119715758 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1387] Histidinol phosphatase and related hydrolases of the PHP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.975012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTCC ACCGGGTGTG GACGAAAGAA AACTGGGCCA ACCAGCGGGA GAACCGACGC GAGAGAGAGG TCAGCGGCGT GCCGGACCAG ACAGCGGGCG ACGAGTACGG CGCCGGGCCG GTCGCCGCCC TGCGCCGGAT CGCGTTCCTG CTCGAGCGCG GCCGGGAGGA CACCTACAAG GTCAAGGCGT TCCGCGGTGC CGCTGCCGCG ATCCTGCCGC TGACGGCCGA GCAGGTCGCC GCGGCGGTCG AGGACGGCAG CCTGACGTCG CTGCCGGGTG TCGGTGCGAG CACGGCCCGG GTCATCGCCG ATGCCGTGCG CGGGGTGCTG CCGACCCGGC TGGCCGAGCT CGAGCGCGAG CACGGCGGTG ACCTGGCGAG CGGCGGCCAG GGGCTCCGCG CCGCCCTGCG CGGCGACCTG CACTCCCACT CCGACTGGTC CGACGGCGGC TCGCCGATCG AGGAGATGGC GTTCACGGCC ATCGAGCTCG GCCACGACTA CCTGGTGCTC ACCGACCACT CGCCGCGGCT GACCGTCGCG CACGGCCTCA GCGCCGAGCG GTTGACCCGC CAGCTGGGCG TGGTCGACGC GGTCAACCGG CACCTCTCCG GGGTCGACGA CTCCTTCACG CTGCTCAAGG GGATCGAGGT CGACATCCTC GACGACGGCT CGCTGGACCA GGACGACGAC CTGCTGGCGC AGCTCGACGT CCGGGTCGCG AGCGTGCACT CCAAGCTCAA GATGGAGCCG GCGGACATGA CCCGGCGGAT GATCGGCGCG ATCCGCAACC CGCGCACCAA CGTCCTCGGT CACTGCACCG GCCGGCTGGT GACCGGCAAC CGCGGCACCC GCCCGGGCTC CCGTTTCGAC GCGGGTGCGG TGTTCGAGGC CTGCGCGGAG CACGACGTCG CGGTCGAGAT CAACTCCCGG CCCGAGCGGC GGGACCCGCC GACGGCGCTG CTGGAGCTGG CCCGGGACGC CGGCTGCGTG TTCTCCATCG ACAGCGACGC CCACGCCCCC GGGCAGCTGG ACTTCCTGGT CTACGGCTGC GAGCGGGCCG AGGCGGCCGG CATCGACCCG GACCGGATCG TCAACACCTG GCCGCGGGAG CGGTTGCTGG CCTGGGCGAG GAAGTAG
|
Protein sequence | MAVHRVWTKE NWANQRENRR EREVSGVPDQ TAGDEYGAGP VAALRRIAFL LERGREDTYK VKAFRGAAAA ILPLTAEQVA AAVEDGSLTS LPGVGASTAR VIADAVRGVL PTRLAELERE HGGDLASGGQ GLRAALRGDL HSHSDWSDGG SPIEEMAFTA IELGHDYLVL TDHSPRLTVA HGLSAERLTR QLGVVDAVNR HLSGVDDSFT LLKGIEVDIL DDGSLDQDDD LLAQLDVRVA SVHSKLKMEP ADMTRRMIGA IRNPRTNVLG HCTGRLVTGN RGTRPGSRFD AGAVFEACAE HDVAVEINSR PERRDPPTAL LELARDAGCV FSIDSDAHAP GQLDFLVYGC ERAEAAGIDP DRIVNTWPRE RLLAWARK
|
| |