Gene Noca_2963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2963 
Symbol 
ID4595747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3148288 
End bp3149775 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content70% 
IMG OID639777568 
ProductRNA polymerase sigma factor 
Protein accessionYP_924152 
Protein GI119717187 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.743322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCGTGT CCTCGAACGC GCGCAAGATG CTTCCTGCCG AGGTGCTCAC GCACCCTGCC 
GTCAAGGCCC TCATCGAGCG AGGTACGCCG ACCGGCAGCA TCACTCCGGA GGAGGTGCGC
CAGGCCAGTG AGGACGCGGC CGTCGAGCCC CGCCACCTCA AGGCGCTGCT CGGGCACCTG
AGCTCGCTGG GCATCGCGGT CGAGATCCCC GTGACCAGCC GCGCTGTCGC GGCCACCTCG
GCCCGCAAGA CCGCCACCGC CAAGACGGCG GCCGCCAAGA AGGCACCGGC GAAGGCGGCG
GCCGCCCCCG CGAAGTCCGC TCCCGCGAAG AAGGCGGCGC CGGCGAAGGC TGCGCCCCGC
AAGGCTGCCG CCGAGGGTGC TGCCGGCACC GAGTCGGCCG AGCAGACCGT GACGGTCGGC
CCGGACGGCA AGAAGGTGCT GCCCGACCTG CCCGACGAGC AGTTCGAGAA GGACGTGGCC
GCCGACCCGA GCATCGCCGA GGACGAGAAG CAGGCGTCGT TCGTCGTCTC CGCGGCCGAC
GACACCGACG AGCCCGAGCA GCAGGTCATG GTGGCCGGCG CCACCGCGGA CCCGGTCAAG
GACTACCTCA AGCAGATCGG CAAGGTCCCC CTGCTCAACG CCGAGATGGA GGTCGAGCTC
GCCAAGCGGA TCGAGGCCGG CCTGTTCTCC GAGGAGAAGC TCGGGAAGGG CGGCAAGCTC
TCGGCGAAGG TGTCCGAGGA GCTGGAGTGG ATCGCCGAGG ACGGCCGGCG CGCGAAGAAC
CACCTGCTCG AGGCGAACCT GCGGCTGGTC GTCTCCCTGG CGAAGCGCTA CACGGGGCGC
GGGATGCTGT TCCTGGACCT GATCCAGGAG GGCAACCTCG GCCTGATCCG CGCGGTCGAG
AAGTTCGACT ACACCAAGGG CTACAAGTTC TCGACCTACG CCACGTGGTG GATCCGCCAG
GCGATCACCC GCGCGATGGC CGACCAGGCC CGCACCATCC GGATCCCGGT GCACATGGTC
GAGGTCATCA ACAAGCTGGC CCGCGTGCAG CGCCAGATGC TCCAGGACCT GGGCCGCGAG
CCCACTCCGG AGGAGCTGGC CAAGGAGCTC GACATGACCC CCGAGAAGGT CATCGAGGTC
CAGAAGTACG GCCGCGAGCC GATCTCGTTG CACACCCCCC TCGGCGAGGA CGGTGACTCC
GAGTTCGGCG ACCTGATCGA GGACTCCGAG GCGATCGTCC CGGCCGACGC CGTGTCGTTC
ACGCTCCTCC AGGAGCAGCT GCACGCCGTC CTCGACACGC TCTCCGAGCG CGAGGCGGGC
GTGGTCAGCA TGCGCTTCGG CCTGACCGAC GGCCAGCCGA AGACCCTCGA CGAGATCGGC
AAGGTGTACG GCGTGACCCG GGAGCGGATC CGCCAGATCG AGTCGAAGAC CATGTCCAAG
CTGCGGCACC CGTCGCGCTC CCAGGTGCTG CGCGACTACC TGGACTGA
 
Protein sequence
MFVSSNARKM LPAEVLTHPA VKALIERGTP TGSITPEEVR QASEDAAVEP RHLKALLGHL 
SSLGIAVEIP VTSRAVAATS ARKTATAKTA AAKKAPAKAA AAPAKSAPAK KAAPAKAAPR
KAAAEGAAGT ESAEQTVTVG PDGKKVLPDL PDEQFEKDVA ADPSIAEDEK QASFVVSAAD
DTDEPEQQVM VAGATADPVK DYLKQIGKVP LLNAEMEVEL AKRIEAGLFS EEKLGKGGKL
SAKVSEELEW IAEDGRRAKN HLLEANLRLV VSLAKRYTGR GMLFLDLIQE GNLGLIRAVE
KFDYTKGYKF STYATWWIRQ AITRAMADQA RTIRIPVHMV EVINKLARVQ RQMLQDLGRE
PTPEELAKEL DMTPEKVIEV QKYGREPISL HTPLGEDGDS EFGDLIEDSE AIVPADAVSF
TLLQEQLHAV LDTLSEREAG VVSMRFGLTD GQPKTLDEIG KVYGVTRERI RQIESKTMSK
LRHPSRSQVL RDYLD