Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3586 |
Symbol | |
ID | 4599465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 3803742 |
End bp | 3805571 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639778194 |
Product | hypothetical protein |
Protein accession | YP_924773 |
Protein GI | 119717808 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0749071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGACC CCGGTCAGTT CGACGAGTTC TACAAGGACG TCCGCACGCG CCTGCTGCTG CTGACGTACT GCCTGACCGG CGACCTCGGA TCCTCCCGTG CCGCCGTCCG CGACGCGTTC GTGGTGGGCT CGCACCACTG GCGCAAGGTG ACCCGCCTGG AGGACCCGGA GGCCTGGGTC CGGGTGCGCG CCTGCGCACA CGCCCAGCGC CGCCACACCG CGAAGCTCTG GCACCGCGAG AAGGGCCTCG ACCCCGAGGT GAAGGCCACG CTGGACGCGC TCGGCAAGCT GTCGCTGGGC CAGCGCAAGG TGCTGCTGCT GACCGAGCTG ACGACCGCCT CGCTCGCCGA GATCGCTCGC GAGGTGGGGC TCCCCCGGCT GGAGGCCGAG CGCGAGCTGC AGACCGCGAT GTCGCGGCTC TCCGTCCTGC GCGAGGTGCC GACCACGAGC ATCCGCACGC TCTTCGACCC GATCCGCGCC CACGTCGACG ACGGCCGCTG GCCGCGAGCC ACGATCATCC GGCGGGCCGG GGCCGCCCGC CGTCGTACCC ACACGGTGAT CGGCGTCGCC GCGACCGTCG CCGCCCTGGT CGTCACCGGC ACGCTGGTCA CCGACGCGAC CGGGGTGCGG CCGACGCTGG CCGGTGAGCG GATGGAGGCC CCCCAGGACC ACAAGCCGTC GAGCTCGCCG ACCCCAGACC CGGTCGACGT TCCCGAGGAC ACGCTCCTCA CGGCCGAGCA GGTCGGCGCC CAGGTGCCCG GCTCGGGCTG GACGGTGACC CGGACCGACG ACAACTCCGG TGGCGACGGC TTGGTGATGC CGTGCCAGGC GAGCCGGTAC GCCGACCCCC GCGGCACCGC CGCCCTGGTT CGGGTCTTCG GATCCGCGGG CAAGGGCTCG GCCGAGACGG TCCAGGCCAC GCAGGCCTCC CCCTCGGCCA AGGCCGCGGG TCGCGGCTAC CGGACCGCGC TCTCCTGGTT CGCCGGCTGC GCCTCCGAGC GCGCCCAGCT GCTCGAGACC CGCGAGGTTG CGGGCGTCGG CGACGAGGCG ATGCTCGTGG TGCTGCGCAC CTGGGACGCT CCCGCCTCGA CCGTGGTCGC GGGCGTCGCC CGGACCGGGC AGCTCACGAC CACGGTCGTC AGCCGCTCGC CGGTCGGCCG GGCGCCGGAG CTGACGAAGT CCGCCGCCCT GCTCGGGTCG GCGGTGACCG AGCTGTGCGG CCGCACCGGC GCCGGGACCT GCTCGAGCCG GCCGCGCCTG CGGACCGTGC CGCCGGTCCC GGTCGCGACC GTGCCCGCCA TGCTCGCCGA GGTCGACCTG CCGCCGGTGG GAGAGGTACG CCGACCCTGG GTCGGCACCG AGCCGCGCCA GGCACGGGAC AACGCCGCGG CCACCGGCTG CGACCGCGCG GACTTCAGCA CCAAGGCGAT GAGCAACAAC GTCACCCGGA CGTTCCTGGT GCCCGGCGCG AAGCTGCGCG CCGAGTTCGG CCTCACCGAG ACGATCGGCT CCCTCCCCGA ACCGAAGGCG GCCGGCTTCG TCGACGACGT CCGCGACCGG CTGGCGAGCT GCTCGAAGAA GCAGATGGGC ACCCAGGTGG ACCGGATCCG CCAGCTCGAG GGCAAGCACC GCGACCTCAC CGTCTGGCGG GTGACCACGG AGATCAGCGA CCAGCGGTCG GTGAGCTACC TGATGGGCAT CGTCCGCGAC CGGACCTCCG TCGCGCAGGT CGGCTTCGTC CCCGACCCCG CCGGCGGGAT GTCCGCCGAC GACTTCGTCG CCCTGGTCGA GCGCGCGCTC GCGCGCCTCG AGGCGATGCC GCGCCCCTAG
|
Protein sequence | MRDPGQFDEF YKDVRTRLLL LTYCLTGDLG SSRAAVRDAF VVGSHHWRKV TRLEDPEAWV RVRACAHAQR RHTAKLWHRE KGLDPEVKAT LDALGKLSLG QRKVLLLTEL TTASLAEIAR EVGLPRLEAE RELQTAMSRL SVLREVPTTS IRTLFDPIRA HVDDGRWPRA TIIRRAGAAR RRTHTVIGVA ATVAALVVTG TLVTDATGVR PTLAGERMEA PQDHKPSSSP TPDPVDVPED TLLTAEQVGA QVPGSGWTVT RTDDNSGGDG LVMPCQASRY ADPRGTAALV RVFGSAGKGS AETVQATQAS PSAKAAGRGY RTALSWFAGC ASERAQLLET REVAGVGDEA MLVVLRTWDA PASTVVAGVA RTGQLTTTVV SRSPVGRAPE LTKSAALLGS AVTELCGRTG AGTCSSRPRL RTVPPVPVAT VPAMLAEVDL PPVGEVRRPW VGTEPRQARD NAAATGCDRA DFSTKAMSNN VTRTFLVPGA KLRAEFGLTE TIGSLPEPKA AGFVDDVRDR LASCSKKQMG TQVDRIRQLE GKHRDLTVWR VTTEISDQRS VSYLMGIVRD RTSVAQVGFV PDPAGGMSAD DFVALVERAL ARLEAMPRP
|
| |