Gene Mvan_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0149 
Symbol 
ID4647043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp161503 
End bp162525 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content67% 
IMG OID639803660 
ProductRNA polymerase factor sigma-70 
Protein accessionYP_951006 
Protein GI120401177 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02960] RNA polymerase sigma-70 factor, TIGR02960 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGC CGGTGCGTAA GCTGCATTTT GTGACCGTCA CGGCTCTTGA CGACAGCAGC 
GCCGAGGACG CCTTTCTCGC GGATGCACAG AGGTATCGGC GGGAGTTGCT GGCGCACTGC
TACCGGATGA CCGGTTCGCT GCACGATGCG GAAGATCTGG TCCAGGAAAC CTATCTACGC
GCCTGGAAGT CCTTCAAGGG GTTTCAGGGC AAATCCTCGG TGCGGACCTG GCTGTACCGC
ATCGCGACGA ACACCTGCCT GACGGCGCTG GACGGCAACA AGCGCCGGGC GCTGCCGAGC
GGGCTGGGCC AGCCGGCGTC CGACCCTGCC GGTGAGCTGT TCGTGCGTCC GGAGGTGACC
TGGCTGGAGC CGCTGCCCGA CGCCCCGCGC GAGGACCCGT CGGATCCGTC GGTGATCGCC
GAGTCCCGCG AGTCGGTCCG GCTGGCGTTC ATCGCCGCCC TGCAGCATCT CCCGCCGCGG
CAGCGCGCGG TGCTGGTGCT GCGCGAAGTG CTGCAGTGGA AGGCCGCGGA GGTCGGTGAG
GCGGTCGGCA CCTCGACCGC CGCGGTCAAC AGCCTGCTGC AGCGGGCCCG CGCCCAGCTC
GACGAGATCT CACCCAGCCG CGATGACGAG CCGGTCCCAC CGGAGTCGCC CGAGGCTGCG
GAGCTGCTGG ACAAGTACAT CGCCGCGTTC GAGGACTACG ACATGGACCG GCTGGTCGAG
CTGTTCACCG ACGACGCGGT GTGGGAGATG CCACCGTTCG ACGGCTGGTA CCAGGGTCCC
GCCAATATCG TCACGCTGTC GAAGGTGCAG TGCCCGGCAG AGAAAGCCGG CGACATGCGC
TTCCTCAGAA CCACCGCCAA CGGGCAACCT GTGGCCGCGC TCTACATGCG CAACCCGGAA
ACCGGTGTGC ACGAGGCATT TCAGCTGCAC GTGCTCGACG CGGGCAAGGC CGGAATCACA
CACGTGGTGG CGTTCAAGGA GAACGACCTG TTCGCCAGGT TCGGGCTGCC CGACACTCTC
TAA
 
Protein sequence
MSQPVRKLHF VTVTALDDSS AEDAFLADAQ RYRRELLAHC YRMTGSLHDA EDLVQETYLR 
AWKSFKGFQG KSSVRTWLYR IATNTCLTAL DGNKRRALPS GLGQPASDPA GELFVRPEVT
WLEPLPDAPR EDPSDPSVIA ESRESVRLAF IAALQHLPPR QRAVLVLREV LQWKAAEVGE
AVGTSTAAVN SLLQRARAQL DEISPSRDDE PVPPESPEAA ELLDKYIAAF EDYDMDRLVE
LFTDDAVWEM PPFDGWYQGP ANIVTLSKVQ CPAEKAGDMR FLRTTANGQP VAALYMRNPE
TGVHEAFQLH VLDAGKAGIT HVVAFKENDL FARFGLPDTL