Gene Noca_3586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3586 
Symbol 
ID4599465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3803742 
End bp3805571 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content74% 
IMG OID639778194 
Producthypothetical protein 
Protein accessionYP_924773 
Protein GI119717808 
COG category 
COG ID 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0749071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGACC CCGGTCAGTT CGACGAGTTC TACAAGGACG TCCGCACGCG CCTGCTGCTG 
CTGACGTACT GCCTGACCGG CGACCTCGGA TCCTCCCGTG CCGCCGTCCG CGACGCGTTC
GTGGTGGGCT CGCACCACTG GCGCAAGGTG ACCCGCCTGG AGGACCCGGA GGCCTGGGTC
CGGGTGCGCG CCTGCGCACA CGCCCAGCGC CGCCACACCG CGAAGCTCTG GCACCGCGAG
AAGGGCCTCG ACCCCGAGGT GAAGGCCACG CTGGACGCGC TCGGCAAGCT GTCGCTGGGC
CAGCGCAAGG TGCTGCTGCT GACCGAGCTG ACGACCGCCT CGCTCGCCGA GATCGCTCGC
GAGGTGGGGC TCCCCCGGCT GGAGGCCGAG CGCGAGCTGC AGACCGCGAT GTCGCGGCTC
TCCGTCCTGC GCGAGGTGCC GACCACGAGC ATCCGCACGC TCTTCGACCC GATCCGCGCC
CACGTCGACG ACGGCCGCTG GCCGCGAGCC ACGATCATCC GGCGGGCCGG GGCCGCCCGC
CGTCGTACCC ACACGGTGAT CGGCGTCGCC GCGACCGTCG CCGCCCTGGT CGTCACCGGC
ACGCTGGTCA CCGACGCGAC CGGGGTGCGG CCGACGCTGG CCGGTGAGCG GATGGAGGCC
CCCCAGGACC ACAAGCCGTC GAGCTCGCCG ACCCCAGACC CGGTCGACGT TCCCGAGGAC
ACGCTCCTCA CGGCCGAGCA GGTCGGCGCC CAGGTGCCCG GCTCGGGCTG GACGGTGACC
CGGACCGACG ACAACTCCGG TGGCGACGGC TTGGTGATGC CGTGCCAGGC GAGCCGGTAC
GCCGACCCCC GCGGCACCGC CGCCCTGGTT CGGGTCTTCG GATCCGCGGG CAAGGGCTCG
GCCGAGACGG TCCAGGCCAC GCAGGCCTCC CCCTCGGCCA AGGCCGCGGG TCGCGGCTAC
CGGACCGCGC TCTCCTGGTT CGCCGGCTGC GCCTCCGAGC GCGCCCAGCT GCTCGAGACC
CGCGAGGTTG CGGGCGTCGG CGACGAGGCG ATGCTCGTGG TGCTGCGCAC CTGGGACGCT
CCCGCCTCGA CCGTGGTCGC GGGCGTCGCC CGGACCGGGC AGCTCACGAC CACGGTCGTC
AGCCGCTCGC CGGTCGGCCG GGCGCCGGAG CTGACGAAGT CCGCCGCCCT GCTCGGGTCG
GCGGTGACCG AGCTGTGCGG CCGCACCGGC GCCGGGACCT GCTCGAGCCG GCCGCGCCTG
CGGACCGTGC CGCCGGTCCC GGTCGCGACC GTGCCCGCCA TGCTCGCCGA GGTCGACCTG
CCGCCGGTGG GAGAGGTACG CCGACCCTGG GTCGGCACCG AGCCGCGCCA GGCACGGGAC
AACGCCGCGG CCACCGGCTG CGACCGCGCG GACTTCAGCA CCAAGGCGAT GAGCAACAAC
GTCACCCGGA CGTTCCTGGT GCCCGGCGCG AAGCTGCGCG CCGAGTTCGG CCTCACCGAG
ACGATCGGCT CCCTCCCCGA ACCGAAGGCG GCCGGCTTCG TCGACGACGT CCGCGACCGG
CTGGCGAGCT GCTCGAAGAA GCAGATGGGC ACCCAGGTGG ACCGGATCCG CCAGCTCGAG
GGCAAGCACC GCGACCTCAC CGTCTGGCGG GTGACCACGG AGATCAGCGA CCAGCGGTCG
GTGAGCTACC TGATGGGCAT CGTCCGCGAC CGGACCTCCG TCGCGCAGGT CGGCTTCGTC
CCCGACCCCG CCGGCGGGAT GTCCGCCGAC GACTTCGTCG CCCTGGTCGA GCGCGCGCTC
GCGCGCCTCG AGGCGATGCC GCGCCCCTAG
 
Protein sequence
MRDPGQFDEF YKDVRTRLLL LTYCLTGDLG SSRAAVRDAF VVGSHHWRKV TRLEDPEAWV 
RVRACAHAQR RHTAKLWHRE KGLDPEVKAT LDALGKLSLG QRKVLLLTEL TTASLAEIAR
EVGLPRLEAE RELQTAMSRL SVLREVPTTS IRTLFDPIRA HVDDGRWPRA TIIRRAGAAR
RRTHTVIGVA ATVAALVVTG TLVTDATGVR PTLAGERMEA PQDHKPSSSP TPDPVDVPED
TLLTAEQVGA QVPGSGWTVT RTDDNSGGDG LVMPCQASRY ADPRGTAALV RVFGSAGKGS
AETVQATQAS PSAKAAGRGY RTALSWFAGC ASERAQLLET REVAGVGDEA MLVVLRTWDA
PASTVVAGVA RTGQLTTTVV SRSPVGRAPE LTKSAALLGS AVTELCGRTG AGTCSSRPRL
RTVPPVPVAT VPAMLAEVDL PPVGEVRRPW VGTEPRQARD NAAATGCDRA DFSTKAMSNN
VTRTFLVPGA KLRAEFGLTE TIGSLPEPKA AGFVDDVRDR LASCSKKQMG TQVDRIRQLE
GKHRDLTVWR VTTEISDQRS VSYLMGIVRD RTSVAQVGFV PDPAGGMSAD DFVALVERAL
ARLEAMPRP