Gene Mfla_2322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_2322 
Symbol 
ID4001418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp2479188 
End bp2481122 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content53% 
IMG OID637939249 
Productsigma 70 (RpoD) 
Protein accessionYP_546430 
Protein GI91776674 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000648058 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00289182 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGTCAA GCATGGCTAA AGAGCAACAG AATGACAATG TAAACGATAA AGCCGATATT 
ACACTGAATG ACGCAGATAT ACGTCGCACG CGGTTAAAGT CCCTGATTAT CCTGGGTAAG
GAGCGCGGAT ACCTCACTTA CGCAGAAATT AACGATCACC TGCCGGATGA TGTACAGGAT
TCCGAGCAGA TCGACAGCAT CATCGGCATG ATCAACGACA TGGGCATCCA GGTCTATGAA
GAGGCGCCCG ACGCCGAAGC ATTGCTGATG TCCGATGTAC CAGCACCAGT TGCTGACGAG
GATGCCGTTG AGGAAGCCGA GCAGGCCCTT TCTACGGTGG ATTCCGAGTT TGGCCGCACC
ACTGACCCGG TGCGCATGTA CATGCGGGAA ATGGGTACGG TCGATCTGTT GACCCGTGAG
GGCGAAATCG AGATTGCCAA GCGGATAGAA GACGGCCTCA AGCATATGGT GCAAGCTATT
GCCGCCTGTC CCAGTACCAT TGCCGAATTA TTGGCCATGG TGGATCAGGT TGAGAAGGGC
GAGTTGGGGG TCGATGATCT GGTCGATGGC CTGATTGATG CTGACGATGG TCTTGGTGCC
GAAATCGTGG CTGAAGAAGC TGCTGATGCT GGAGATGTGG CTGAAGACGC CGACGAGGAG
GACGAAGAAG ACGATGATGG TGCCAAGGCG GCAGCGATCT CGGCAGAAGC ACTTGCAAGG
CTTCGCGAGG AAGTGCTGAC GCGATTTGCC GTCATCCGTA GCGCTCATGA GAAGATGACC
GTGCTTTTGG CAAGCAAGGG TTCTCAGGAC AAAGAGTATC TCAAGCTGCA GGCTACGATT
ATTGAAGAGC TGATGGCATT CCGCTTTTCA GCCAAGCAGG TTGAGGCATT GTGTGACCGT
GTGCGTGGCA TGGTAGAAGA GATCCGTAGC CATGAACGTA AAATCATGGA CTTCTGCGTC
GAAAAGGCCG GTATGCCGAG GGCGCATTTC ATTAAGAGCT TCCCGGGCAA CGAGGGCAAT
CCAGAGTGGT TGGCTGCGGA ACTTGCCGTC AACAAACCGT ATGTTGAGCG TCTTGAGCGA
TTCAGGCACT CGATCGATGA CCAGCAGCAA CAATTACTTG CCATTCAAGA GCGTGTGGGT
ATTCCCATCA GGGAGCTCAA GGAAATCAAC AAGCAGATGT CCACAGGCGA GGCACGGGCG
CGTCGTGCCA AGCGCGAGAT GATCGAGGCC AACTTGCGTC TCGTGATTTC TATCGCCAAG
AAATACACCA ACCGTGGCCT GCAATTCCTT GACCTGATTC AGGAAGGCAA TATCGGTTTG
ATGAAGGCGG TAGACAAGTT CGAATACCGT CGTGGTTACA AGTTCTCTAC ATATGCCACA
TGGTGGATTC GCCAGGCAAT CACACGCTCG ATCGCAGATC AGGCGCGTAC TATTCGCATT
CCTGTGCACA TGATCGAGAC GATCAACAAG ATGAATCGCA TCAGCCGCCA GATCCTGCAG
GAAACCGGCT TGGAGCCAGA TCCAGCCACC CTTGCGGAAA AAATGGAGAT GCCGGAAGAT
AAGATCCGCA AAATCCTCAA GATTTCCAAG GAGCCTATCT CCATGGAAAC CCCAATTGGA
GATGACGACG ATTCCCACTT GGGTGATTTC ATCGAGGATG TCGCAACACT TGCTCCCGTG
GATGCCGCCG TATACGCAAG CTTGCGTGAT GCCACCAAAG AAGTGTTAGA ATCGCTGACC
GCAAGGGAGG CCAAGGTGTT GCGCATGCGG TTCGGTATTG AGATGAATAC CGACCACACC
CTGGAAGAGG TTGGCAAACA ATTCGATGTC ACCCGCGAAC GTATCCGTCA AATTGAGGCC
AAGGCGCTAC GCAAACTGCG CCATCCAACC CGCTCAGAAC GCCTGCGCAG TTTCCTCGAA
ACTGGCAACG ACTAA
 
Protein sequence
MGSSMAKEQQ NDNVNDKADI TLNDADIRRT RLKSLIILGK ERGYLTYAEI NDHLPDDVQD 
SEQIDSIIGM INDMGIQVYE EAPDAEALLM SDVPAPVADE DAVEEAEQAL STVDSEFGRT
TDPVRMYMRE MGTVDLLTRE GEIEIAKRIE DGLKHMVQAI AACPSTIAEL LAMVDQVEKG
ELGVDDLVDG LIDADDGLGA EIVAEEAADA GDVAEDADEE DEEDDDGAKA AAISAEALAR
LREEVLTRFA VIRSAHEKMT VLLASKGSQD KEYLKLQATI IEELMAFRFS AKQVEALCDR
VRGMVEEIRS HERKIMDFCV EKAGMPRAHF IKSFPGNEGN PEWLAAELAV NKPYVERLER
FRHSIDDQQQ QLLAIQERVG IPIRELKEIN KQMSTGEARA RRAKREMIEA NLRLVISIAK
KYTNRGLQFL DLIQEGNIGL MKAVDKFEYR RGYKFSTYAT WWIRQAITRS IADQARTIRI
PVHMIETINK MNRISRQILQ ETGLEPDPAT LAEKMEMPED KIRKILKISK EPISMETPIG
DDDDSHLGDF IEDVATLAPV DAAVYASLRD ATKEVLESLT AREAKVLRMR FGIEMNTDHT
LEEVGKQFDV TRERIRQIEA KALRKLRHPT RSERLRSFLE TGND