Gene Noca_2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2278 
Symbol 
ID4597824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2426863 
End bp2428284 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content66% 
IMG OID639776877 
Productsigma-70 region 4 domain-containing protein 
Protein accessionYP_923470 
Protein GI119716505 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCTCGC CTCGATCGGC GGATGACTCG GCCCAGGATG AGCGCAGCCG CGCGGGCGCG 
CCCAGCGCTG GCGGCTGTGC CGAGACGGTC GATGACCTGG GCGACGGGGA CCTCGGCGCT
CGCGGGTGGC TCGATGACGC TTGGGTGGCT CTCCCTGAAC GCAGCCGAGA CATCCTCCGT
CGGCGCCTCG CTGGGGAGAC ACTGGATGAG ATTGGACTGG CTCTGAACCT CACCCGCGAA
CGCGTCCGGC AGGTACAGAA GGCGTCTGAA GGCGCGCTCG TGCGCGCGAT GGAAAATCGG
GCCCCGCAGT TTCTCTCCGC CTTGCACACG GAACTCGGCG ACTCCCCGGC GGTGGCGCAT
CGTCACCTCG CGAAGCTTGT GGACGCTCAC TCCACGACCG CGCTGGGGTG TTTGCTCAAG
ACCTTGGGTG CCAGCCATCC CCGCACCTGG GCTGGGGCGC TCTCCGAGTT CTGGACCTTT
CGGCCGAACG AGCTGCGACA ACAATTGGGC CGAATGGTCG AGCTCGCACC CCTGACCCAC
GAGGAAGCTG ACCAAGCCGC CGCCGGGCTT GGCCTTCCCG AGGACCTCGA TTGGCGAAGC
GTTCTCGCGC ACAGAAACAG CAAACTGGCC GCCCACGATC TGGGTTGGAT TCGACCCGCG
CGACTCACCC GAGACGTGGC GTATCTCTGG CTCAAGCTCG AGGGCGAACC ACGAGCTGCG
GACGAGATCG CGGTACAGGC AGGATGCAGT GAACATGCCG CGCGGGAGAA CATGCGACGA
GACCCAGCCT TCTCACAAGT TCGACCAGAG GGCACTTGGG CGCTCTCCGA TTGGCGTGTC
CCCGGCTCAG AGAACCGCTA TGGCTCCGCC GTTGACGCCC TGGTTGAGGT TCTCCGAGAT
CTCGGGCCGC TGCCTGTTGA CCAACTCCGC GTTGAGACAC AGCGCCGTTA CCCCGTCAGT
GATTGGCGGG TGAACCAGTG CCTGTCGAGC AACCTCATCG GGCTTAACCC CGCTGGCCTG
TATGACCTGG CCGAACGGGG GGCTGTCCCC GTCGAGGACA CGGAACCCAA GCAGCCTCCG
AACATCAAGA CCAGTGGAGA CGTGGTGGGC ATCGAACTCG TAGTCGACCG CGAGATCCTT
CGCGGCAGCG GTATACCCGT GAATCGCTGG CTCACTTGGC AGCTGGGCCT CAGAACCGCT
CCGTCGACAA GATACTTTGC CCTCCCGGAG GGACACGGCG AGGTGCGCGT CACGCGGATG
ACCAGCAATG CTCAAGTCTC GAGTTTGCGT GCGGTGGCCG CCGAGTTCGG CCTCGTTGAG
GGCTGCAAGT TTGCGCTCCT GCTCAACACG AGCACGGACA CCGCCAGCGT CCGCCACATC
TGCCCGCAGG ACGCGTGTGG CGCGCGTAGC GCGACGCACT GA
 
Protein sequence
MFSPRSADDS AQDERSRAGA PSAGGCAETV DDLGDGDLGA RGWLDDAWVA LPERSRDILR 
RRLAGETLDE IGLALNLTRE RVRQVQKASE GALVRAMENR APQFLSALHT ELGDSPAVAH
RHLAKLVDAH STTALGCLLK TLGASHPRTW AGALSEFWTF RPNELRQQLG RMVELAPLTH
EEADQAAAGL GLPEDLDWRS VLAHRNSKLA AHDLGWIRPA RLTRDVAYLW LKLEGEPRAA
DEIAVQAGCS EHAARENMRR DPAFSQVRPE GTWALSDWRV PGSENRYGSA VDALVEVLRD
LGPLPVDQLR VETQRRYPVS DWRVNQCLSS NLIGLNPAGL YDLAERGAVP VEDTEPKQPP
NIKTSGDVVG IELVVDREIL RGSGIPVNRW LTWQLGLRTA PSTRYFALPE GHGEVRVTRM
TSNAQVSSLR AVAAEFGLVE GCKFALLLNT STDTASVRHI CPQDACGARS ATH