Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1495 |
Symbol | |
ID | 4596311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1584629 |
End bp | 1585900 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639776093 |
Product | N-isopropylammelide isopropylaminohydrolase |
Protein accession | YP_922696 |
Protein GI | 119715731 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.55402 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCACTC ACCAGATCGT CGTCGCGGAC GCGCGACTCC GCCATCCACC GGGCGGCCGT GACGGTCGCT GGCAGGTCGG GATCGACGAC GGCCGGATCA CGGCGCTCGC GCAGGACCCG CTGACCGGCA CCGACGAGAT CGACGCCGCC GGCGCGCTGG TCACGGAGAG CTTCGTCAAC GGTCACATGC ACCTGGACAA GGTGCACACC CTCGACCGGA TCGGCGACGC CGCGCTCACG GCGTACACCT CCGGCGACAT GGGCTCGGCG ATGACCTCCA TCGAGCTCGC CTCGGCGGTC AAGGCCGGCT ACGACCGGAG CTGGATCGAG CCCAACGCGC GCACGGCCGT CCTCGAGGCC GTGCGGTACG GCGTCCGCCA CGTGCTGGCG TTCGCGGACG TGGACCCCAA GGCGCGACTG GAGGGCGTGA TCCCGCTGCT GGCGCTGCGC GAGGAGTTCC GCGGCGTCGT CGACCTCCAG GTCGTCGCCT TCCCGCAGGA CGGACTCCTC CGCGACCCGG GTGCCGAGGA GCTGGTGCGC GAGGCCGTCG AGCTCGGCGC CGACGTGGTC GGCGGCATCC CCTGGATCGA GCACACCGAC GCCGATGCCC GGGAGCACGT GCAGCGGATG TGCGCCCTCG CCGCGACCCA CGACCGCCGG GTCGCGATGC TCGTCGACGA CGCCGGCGAC GCCGGCCTGC GCACCACCGA GATGCTCGCC GTCGCGATGA TCGAGCACGG CCTGGTCGGC CGCGGCGTCG CCAACCACGC CCGGGCCGTC GGCCTCTACG CGCGCCCGTC CCTCGAGCGG CTCGCCGGGC TCGCCCGCCG CGCCGGTCTC GGCTTCGTCA GCGACCCGCA CACCGGGCCG TTGCACCTGC CGGTCCGTGA GCTGACGGCG ATGGGCGTCC CGGTCGCCCT CGGCCAGGAC GACATCGAGG ACGCCTACTA CCCGTTCGGT CGGCACAACA TGGGCGAGAT CGCGTTCCTC GCGGCGCACG CCCTCGAGGC GGTCGACAGC GCCGGCATCG ACCTGGTGTA CGACGGCGTG ACCACCACCG CGGCCCGGGT CCTCGGCGTG ATCGGGCACC GGCTCGAGGT GGGCGGCAAC GCGGACCTGG TCGTCCACCA CCACCCGACG CTGCGCGAGG TCGTCGGCCA CCACGCGGCC CCGGCGTACG TCCTCGCCTC GGGCCGGTTG GTCGCCAGCT CGAGCGCCAC CACCACGTTC CACCTGGACA ACCTGGACAA CCTGGAAGGG AACCAGCCAT GA
|
Protein sequence | MPTHQIVVAD ARLRHPPGGR DGRWQVGIDD GRITALAQDP LTGTDEIDAA GALVTESFVN GHMHLDKVHT LDRIGDAALT AYTSGDMGSA MTSIELASAV KAGYDRSWIE PNARTAVLEA VRYGVRHVLA FADVDPKARL EGVIPLLALR EEFRGVVDLQ VVAFPQDGLL RDPGAEELVR EAVELGADVV GGIPWIEHTD ADAREHVQRM CALAATHDRR VAMLVDDAGD AGLRTTEMLA VAMIEHGLVG RGVANHARAV GLYARPSLER LAGLARRAGL GFVSDPHTGP LHLPVRELTA MGVPVALGQD DIEDAYYPFG RHNMGEIAFL AAHALEAVDS AGIDLVYDGV TTTAARVLGV IGHRLEVGGN ADLVVHHHPT LREVVGHHAA PAYVLASGRL VASSSATTTF HLDNLDNLEG NQP
|
| |