Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3933 |
Symbol | |
ID | 4598068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4139472 |
End bp | 4140890 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 639778539 |
Product | hydrolase |
Protein accession | YP_925118 |
Protein GI | 119718153 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases [COG1606] ATP-utilizing enzymes of the PP-loop superfamily |
TIGRFAM ID | [TIGR00268] conserved hypothetical protein TIGR00268 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.158663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGCGG CGCCCCTGGT GGTCGGTTTC GACCTCGACA TGACGCTGAT CGACACCGTG CCGGGCTTCA GTGCGACGCT GCTGGCGCTC GGCGCGGAGC TCGGCGTCGA GTTCCCGGTC GAGGACCTGA TCACCCGTCT CGGACCGCCG CTCGACCTGC TGCTGGGGGA GCACCTCGCC GCGGACGCGG TGGCGCCCGC CGGCGACCGG TTCCGGGCGC TCTACCCCGA CCACGCCATC GCCCCGGTCC CGCTCCTCGC CGGCGCCGAG GACGCCCTGG CGGCTGTCCG CCGGCACCGT GGCCGGGTGC TGGTCGTGAC CGGGAAGTAC CCCGCCAACG CGCGCCTGCA CCTCGATCAC CTCGGCGTGG AGGTGGACCA CCTCGAGGGG TGGGTGTGGG GCGTCGGCAA GGCGGACGTG CTGCGCCGCG AGGGCGCGAC CATCTACGTC GGCGACCACG TCCACGACGT CGAGGGCGCC AAGGCCGCCG GCGCGCTGAG CGTCTCGGTG CTCACCGGCG GGTCGACGCG GGAGGAGCTG GTCGCGGCCG GCACCGACGT GCTCCTCGGC AGTCTCGCGG AGTTCCCGGA CTGGCTCGAG GAGCACCTGC TGCAGACCCG GCTCGACGCG CTCGCGGCCG ACCTGCGCGA GCGTGGCTCG GTGCTGGTCG CCTACAGCGG GGGCGCAGAC AGCGCGTTCC TCCTGGCCGC TGCCGTCCGC GCGCTGGGCG CCGACCGCGT CGCGGCCGCC ACCGGCTACT CGCACTCGCT GCCGCTGGCC GAGCGTGACC CGGCACGCGA CTTCGCGGCC GCGCTCGGCG TCGAGGTGCT CACCCCGGCC ACCCACGAGA TCGAGCGCGA GGGCTACCGG TCCAACGGGG CGGACCGCTG CTACTTCTGC AAGGCCGAGC TGCTCGACGT GCTCACCCCG CTCGCCGCCG GTCGCGGGCT GGCCCACGTC GCCACCGGCA CCAACGCCGA CGACGCCGTC GCGGGCTTCC GGCCCGGCAT CCGCGCAGCC GACGAGCGCG GCGCGATCGC GCCCCTGCGC GACGCCGGGC TGACCAAGGC CCAGGTCAGG GAGGCCTCCC GCCGCTGGGA CCTGCCGACC TGGGACAAGC CGGCGGCCGC CTGCCTGTCC TCGCGGGTCG CGTACGGCGT CGAGGTGACG CCGTACCGAC TGGGCCGGGT GGAGCGGGCC GAGACCGCGG CGCGGGCGCT GCTGGCGGCC GTCGGGCTGC GCAACCTGCG GGTCCGTGAC CTCGGCGAGC GCGCCTGCGT GGAGGTCGAC GCGGCCCTGC TGCCGCTGGC CGCCGACGTG GAGGCCCGGC TGCTGGACGC GGTGCGCGGG GCCGGGTTCG CGAGTGCCGA GGTGGACCGG CGCGGGTTCC GTTCGGGGTC GATGAACGAG GCGCTATAG
|
Protein sequence | MHAAPLVVGF DLDMTLIDTV PGFSATLLAL GAELGVEFPV EDLITRLGPP LDLLLGEHLA ADAVAPAGDR FRALYPDHAI APVPLLAGAE DALAAVRRHR GRVLVVTGKY PANARLHLDH LGVEVDHLEG WVWGVGKADV LRREGATIYV GDHVHDVEGA KAAGALSVSV LTGGSTREEL VAAGTDVLLG SLAEFPDWLE EHLLQTRLDA LAADLRERGS VLVAYSGGAD SAFLLAAAVR ALGADRVAAA TGYSHSLPLA ERDPARDFAA ALGVEVLTPA THEIEREGYR SNGADRCYFC KAELLDVLTP LAAGRGLAHV ATGTNADDAV AGFRPGIRAA DERGAIAPLR DAGLTKAQVR EASRRWDLPT WDKPAAACLS SRVAYGVEVT PYRLGRVERA ETAARALLAA VGLRNLRVRD LGERACVEVD AALLPLAADV EARLLDAVRG AGFASAEVDR RGFRSGSMNE AL
|
| |