Gene Noca_3172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3172 
Symbol 
ID4600157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3370302 
End bp3371180 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content72% 
IMG OID639777778 
Productsigma-70 region 2 domain-containing protein 
Protein accessionYP_924361 
Protein GI119717396 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02957] RNA polymerase sigma-70 factor, TIGR02957 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGG ACCCCTTCGT CGCTCACCGC AGCCTGCTGT TCACCGTCGC CTACGAGATG 
CTCGGGTCGG TCGCCGACGC CGAGGACGTG GTGCAGGAGA CCTGGCTGCG CTGGGCAGCC
CTGCCCGCCG CCGACCGTGG TGAGGTCCGA GATCCCCGTG CCTACCTCGT GCGGATCGTC
ACCCGGCTCT CCCTCAACCG GCTGCGTACG CTCACCCGGC TGCGGGAGGA GTACGTCGGC
GAGTGGCTCC CCGAGCCGCT GCTCACCAGC CCCGACGTCG CCGAGGACGT CGAGCTCGCG
GAGAGCGTGT CGATCGCCAT GCTCGCGGTC CTCGAGACAC TGCTGCCGAC CGAGCGTGCG
GTCTTCGTGC TCCGGGAGGT CTTCGACGTG CCCTACGACG AGATCGCGGC GGCGTTGGAC
AAGTCCTCCG CTGCGGTGCG CCAGATCGCG TCCCGGGCCC GCAAGTACGT GGCGGCCCGC
CGGCCCCGGA CCTCGGTGAG CCGTGCGGAG CAGGAGCGGG TGGTCGAGCG GTTCCTCGCC
GCCCTGACGA CCGGCGACGT CGTGGGACTG CTCGACGTGC TCGCTCCGGA CGTGCTCCTC
GTGGGCGACG GCGGCGGCCT GGTCCCGACC GTCCCGAGTC CCGTGCGCGG GGCGGCCCGG
CTCGCCCCGG TGATGGCCCG CTTCGCCGAG CTCGCGCCCG GCACGACGGC CGTCATCGTC
GACCTCAACG GCGGCATCGC GGCGCGCATC GATCCCGGCG GCCAGAACGA CACGGCCGTC
TCGTTCGTCA TCGAGGGCCA CCGGATCGCG CAGATCTACG CGATCCGCAA CCCCCACAAG
CTCCAGCGCC TGGCAGAGGT GGCCGAGCTC CGACGGTGA
 
Protein sequence
MSEDPFVAHR SLLFTVAYEM LGSVADAEDV VQETWLRWAA LPAADRGEVR DPRAYLVRIV 
TRLSLNRLRT LTRLREEYVG EWLPEPLLTS PDVAEDVELA ESVSIAMLAV LETLLPTERA
VFVLREVFDV PYDEIAAALD KSSAAVRQIA SRARKYVAAR RPRTSVSRAE QERVVERFLA
ALTTGDVVGL LDVLAPDVLL VGDGGGLVPT VPSPVRGAAR LAPVMARFAE LAPGTTAVIV
DLNGGIAARI DPGGQNDTAV SFVIEGHRIA QIYAIRNPHK LQRLAEVAEL RR