Gene Mflv_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3939 
Symbol 
ID4975254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp4198579 
End bp4200045 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content68% 
IMG OID640458166 
ProductRNA polymerase sigma factor 
Protein accessionYP_001135198 
Protein GI145224520 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCGCCC CGACGACCCG CACGACCGAA AGGGTGTACG TGGCAGCGAC CAAGGCCAGC 
GCGGCGACCG ACGAGCCGGT GAAGCGCACC GCCGCCAAGG CCCCCGCGAA GAAGGCGCCC
GCGAAGAAGG CCACCAAGGC CACCAAGCCC CGCGCTGCGA AGTCCGCCGA CGCCCCGGCG
ACGCGCGGCC GCGCCAAGAA GGCCACCGCC GCCGAGCCGG GTGTCGTCGA CGACGAACTC
ACCGGCACCG AGGTCGACGC CACCGACGAC ATCGAGCCCG GCGAGGATCT CGTCGACGAC
GGTGACCTCG AGCTCGACGA CATCGAGGTC GAGGACGAGG CCACCGAGGA CGACGATTCC
GACGACGAGG CCGCCGAGGC CGATCCGGTG GTGGCCACCA CCAAGGCCGC CGCGAAGCCG
GCCAAGGAGG CCGACGACGA TTCGCTCCCC GAGCCGTCGG AGAAGGACAA GGCCTCCGGA
GACTTCGTCT GGGACGAAGA GGAGTCCGAG GCGCTGCGTC AGGCCCGCAA GGACGCCGAG
CTCACCGCAT CGGCCGACTC GGTTCGCGCC TACCTCAAGC AGATCGGCAA GGTCGCGCTC
CTCAACGCCG AGGAAGAGGT CGAGCTCGCC AAGCGCATCG AGGCCGGCCT GTACTGCACG
CAGCTGATGG CCGAGTTCGC CGAGAAGGGC GAGAAGCTCA CCACCGCGCA GCGGCGTGAC
TACATGTGGA TCTGCCGCGA CGGCGACCGC GCGAAAAATC ATCTGCTGGA AGCGAACCTG
CGCCTGGTGG TGTCGCTGGC CAAGCGCTAC ACCGGCCGCG GCATGGCGTT CCTGGACCTC
ATCCAGGAGG GCAACCTCGG CTTGATCCGC GCCGTGGAGA AGTTCGACTA CACCAAGGGT
TACAAGTTCT CGACCTACGC GACGTGGTGG ATCCGTCAGG CGATCACCCG CGCGATGGCC
GATCAGGCGC GCACCATCCG CATCCCGGTG CACATGGTCG AGGTCATCAA CAAGCTCGGC
CGCATCCAGC GCGAGCTCCT TCAGGACCTG GGTCGCGAGC CCACCCCCGA AGAGCTGGCC
AAGGAAATGG ACATCACGCC GGAGAAGGTG CTGGAGATCC AGCAGTACGC GCGTGAGCCG
ATCTCGCTGG ACCAGACCAT CGGCGACGAA GGCGACTCGC AGCTCGGCGA TTTCATCGAG
GACTCCGAGG CCGTCGTCGC CGTCGACGCC GTGTCCTTCA CGCTGCTGCA GGACCAGCTG
CAGTCGGTGC TGGAGACGTT GTCCGAGCGT GAGGCAGGCG TCGTCCGCCT GCGGTTCGGC
CTCACCGACG GCCAGCCGCG CACCCTCGAC GAGATCGGCC AGGTCTACGG CGTGACGCGC
GAGCGCATCC GTCAGATCGA GTCGAAGACG ATGTCGAAGT TGCGCCATCC CAGCCGGTCT
CAGGTGCTGC GCGACTACCT GGACTAG
 
Protein sequence
MCAPTTRTTE RVYVAATKAS AATDEPVKRT AAKAPAKKAP AKKATKATKP RAAKSADAPA 
TRGRAKKATA AEPGVVDDEL TGTEVDATDD IEPGEDLVDD GDLELDDIEV EDEATEDDDS
DDEAAEADPV VATTKAAAKP AKEADDDSLP EPSEKDKASG DFVWDEEESE ALRQARKDAE
LTASADSVRA YLKQIGKVAL LNAEEEVELA KRIEAGLYCT QLMAEFAEKG EKLTTAQRRD
YMWICRDGDR AKNHLLEANL RLVVSLAKRY TGRGMAFLDL IQEGNLGLIR AVEKFDYTKG
YKFSTYATWW IRQAITRAMA DQARTIRIPV HMVEVINKLG RIQRELLQDL GREPTPEELA
KEMDITPEKV LEIQQYAREP ISLDQTIGDE GDSQLGDFIE DSEAVVAVDA VSFTLLQDQL
QSVLETLSER EAGVVRLRFG LTDGQPRTLD EIGQVYGVTR ERIRQIESKT MSKLRHPSRS
QVLRDYLD