Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1942 |
Symbol | |
ID | 4599847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2072359 |
End bp | 2073519 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639776540 |
Product | nifR3 family TIM-barrel protein |
Protein accession | YP_923139 |
Protein GI | 119716174 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.439583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTCGC CCACCTCACC GACCGGCAGT CCGGCCGGGG GGCTGGCCCT GGGGTCGCTG CGCGTCGACA CCCCGGTGGT GCTCGCGCCG ATGGCCGGCA TCACCAACGC GGCCTACCGC CGGCTGTGTG CCGAGCAGGG GGCCGGCCTC TACGTCTGCG AGATGATCAC CAGTCGGGGG CTGGTCGAGG GCGACCAGCA CACCAAGGAC ATGCTGGTCT TCGACGAGCT CGAGACGATC CGCTCCGTCC AGCTCTACGG CAGCGACCCG GCCTACGTCG GCAAGGCCGC CGAGATCCTC TGCGCGGAGT ACGGCGTCGC GCACATCGAC CTCAACTTCG GCTGCCCGGT GCCCAAGGTG ACCCGCAAGG GCGGCGGCGG CGCGCTGCCG TGGAAGCGCG GGCTGCTCGC CGAGATCCTG GAGTCGGCCG TCGCCGCGGC CGCGCCGTAC GACGTGCCGG TCACGATGAA GACCCGCAAG GGCATCGACG AGGATCACCT CACCTACCTC GACGCCGGCC GGATCGCGCA GGAGTCCGGA TGTGCGGCGA TCGCGCTGCA CGGCCGGACG GTCGCGCAGG CCTACTCCGG CGCGGCCGAT TGGGACGCGA TCGCGGCGCT GGTCGAGCAC GTGGACATCC CGGTGCTCGG CAACGGCGAC GTCTGGGAGG CCGCGGACGC ACTGCGGATG GTCGAGGAGA CCGGCGTCGC GGGCGTCGTG GTCGGCCGCG GCTGCCTGGG CCGGCCCTGG CTCTTCCGCG ACCTCGCCGC CGCGTTCGGT GGCGAGGACG TCGCGACCCT GCCGGCCCTG GGCGAGGTGG CCGCGATGAT GCGCCGGCAC GCCGAGCTGC TGTGCCAGCA CCTGGGGGAG GAGCGCGGCT GCAAGGAGTT CCGCAAGCAC GTGACCTGGT ACCTCAAGGG TTTCGGCGCG GGCGGCGAGA TGCGGCGCTC GCTGGGCCTG GTCGACAGTC TCGCGGCACT CGACCGGCTG CTGGCCGAGC TCGACCCGGA CGAGCCGTTC CCGGAGCGCG AGCTGGGCGC CCCGCGCGGG CGCCAGGGAT CGCCGCGGGC CAAGGTCGCC CTGCCCGAGG GTTGGCTCGA GGACGCGGAC GGTCGCGGCC GGCACGTCCA GGAGGACGCG GACGAGACCA CCGGCGGGTG A
|
Protein sequence | MSSPTSPTGS PAGGLALGSL RVDTPVVLAP MAGITNAAYR RLCAEQGAGL YVCEMITSRG LVEGDQHTKD MLVFDELETI RSVQLYGSDP AYVGKAAEIL CAEYGVAHID LNFGCPVPKV TRKGGGGALP WKRGLLAEIL ESAVAAAAPY DVPVTMKTRK GIDEDHLTYL DAGRIAQESG CAAIALHGRT VAQAYSGAAD WDAIAALVEH VDIPVLGNGD VWEAADALRM VEETGVAGVV VGRGCLGRPW LFRDLAAAFG GEDVATLPAL GEVAAMMRRH AELLCQHLGE ERGCKEFRKH VTWYLKGFGA GGEMRRSLGL VDSLAALDRL LAELDPDEPF PERELGAPRG RQGSPRAKVA LPEGWLEDAD GRGRHVQEDA DETTGG
|
| |