Gene Noca_3875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3875 
Symbol 
ID4598010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4092849 
End bp4093865 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content67% 
IMG OID639778481 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_925060 
Protein GI119718095 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.372759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCATCG CACAGCGCCC CACCCTGTCG GAAGAGACCG TCGACGAGTT CCGCTCGCGG 
TTCGTGATCG AGCCCCTGGA GCCCGGCTTC GGCTACACGC TGGGCAACTC GCTCCGCCGT
ACCCTCCTCA GCTCGATCCC GGGTGCCTCG GTCACGAGCA TCAAGATCGA CAACGTCCTC
CACGAGTTCT CCACCATCGA GGGGGTCAAG GAGGACGTCA CGGAGGTCAT CCTCAACCTC
AAGGGTCTCG TCGTCTCCTC GGAGCACGAC GAGCCCGTCA CCATGTACCT GCGCAAGTCG
GGTGCCGGTG ACGTGACCGC CGCCGACATC GCGCCGCCGG CCGGTGTCGA GGTGCACAAC
CCCGACCTGA AGATCGCGAC CCTGTCCGAC AAGGGCAAGC TGGAGATGGA GCTGGTCGTC
GAGCGTGGCC GTGGCTACGT CTCCGCCGTC CAGAACAAGG GCGCCGACAA CGAGATCGGC
CGGATGCCGG TCGACTCGAT CTACAGCCCG GTCCTCAAGG TGACCTACAA GGTCGAGGCC
ACCCGTGTCG AGCAGCGCAC CGACTTCGAC AAGCTCGTCA TCGACGTCGA GACCAAGCCG
TCGATCCGGC CCCGCGACGC GATCGCGTCG GCCGGCAAGA CCCTGGTCGA GCTCTTCGGC
CTGGCCCGCG AGCTGAACGT CGAGGCCGAG GGCATCGACA TCGGCCCGTC GCCGGTCGAC
GAGCAGCTGG CCGCGGACCT CGCCCTCCCG GTCGAGGACC TGCAGTTGAC CGTCCGCTCC
TACAACTGCC TCAAGCGCGA GGGCATCCAC ACCGTGGGTG AGCTCATCAG CCGCTCGGAG
CAGGACCTGC TCGACATCCG CAACTTCGGT GCGAAGTCGA TCGACGAGGT CAAGGCCAAG
CTGGTCGAGA TGGGCCTGTC CCTCAAGGAC AGCGCGCCCG GCTTCGACCC GCACGCCGCG
CTCGCGGCGT ACGGCGATGA CGACGACGAC GCGTTCGTCG AAGACGAGCA GTACTGA
 
Protein sequence
MLIAQRPTLS EETVDEFRSR FVIEPLEPGF GYTLGNSLRR TLLSSIPGAS VTSIKIDNVL 
HEFSTIEGVK EDVTEVILNL KGLVVSSEHD EPVTMYLRKS GAGDVTAADI APPAGVEVHN
PDLKIATLSD KGKLEMELVV ERGRGYVSAV QNKGADNEIG RMPVDSIYSP VLKVTYKVEA
TRVEQRTDFD KLVIDVETKP SIRPRDAIAS AGKTLVELFG LARELNVEAE GIDIGPSPVD
EQLAADLALP VEDLQLTVRS YNCLKREGIH TVGELISRSE QDLLDIRNFG AKSIDEVKAK
LVEMGLSLKD SAPGFDPHAA LAAYGDDDDD AFVEDEQY