Gene Mfla_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_0859 
Symbol 
ID4000428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp899620 
End bp901053 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content54% 
IMG OID637937759 
Productpeptidase S1C, Do 
Protein accessionYP_544968 
Protein GI91775212 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.078917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0183856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAA AGCTAATTGC CATGTCAGCA ATTTGTTTAT TTGTTGGTAT GGCGGGGGCA 
ACGCCGGTTT TAGCCAAGGA ATTGCCCGAT TTTACCGAGC TGGCGGAAAA GCAGGGAGCG
GCAGTGGTCA ATATCAGCGT GACCCAGGTC GTACAGTCTG GAATAGGTGG ATCTCCTTTT
CCCGGATTCC CCGAAGATGA GGCATTGAAT GAATTCTTTC GCCGTTTTGG CATTCCAGGG
TTTCCGGGTG TGCCGCGCGG ACAAGGTGGT CCACAGCAAC CTGAATTTAA ATCCCAGTCC
CTCGGGTCAG GATTCATCAT TAGCAGCGAT GGTTATATCC TGACGAATGC CCATGTAGTT
CGCGAAGCCG ATGAAGTGAT CGTCAAGCTG AATGATAAAC GTGAATTTCA GGCCAAGATT
GTGGGGGTTG ACCGCCGCAC GGATGTCGCG CTGCTTAAAA TTGATGCGAC AGGGCTGCCG
AAGGTCACCA TTGGCAATCC TGAGCAACTG AAGGTAGGGG AGTGGGTGGT GGCAATTGGC
TCCCCGTTTG GACTGGAAAG TACGTTGACC GCCGGTGTGG TCAGTGCAAA AGGCCGTGCC
TTGCCACAGG AAAATTTTGT GCCTTTCATC CAGACCGATG TTGCCATTAA CCCTGGCAAT
TCTGGCGGAC CGTTATTCAA CCTCAAGGGT GAGGTGGTAG GCATTAACTC CCAGATATAC
AGCCGAACTG GCGGTTATAT GGGGTTATCG TTCGCCATTC CGATTGATGT GGCCATGGAT
GTTGCCAATC AGCTCAAGAT TTCCGGTCGC GTAGCGCGTG GCTGGCTTGG GATCGGTATT
CAGGAAATGA CCAAGGAGCT GGCTGAGTCG TTTGGTATGA AGAATACCAA AGGGGCTTTG
GTCGCCGGCG TGGAAAAAGG CAGTCCTGCT GAAAAGGGCG GCCTGGAGCC AGGTGATGTC
GTAATCAAGT TCGATGGCAA GGATGTCAAT GTTTCTTCCG ATTTGCCGCG TATCGTTGGT
TCCACCAAGC CTGGCAAGAA GGTGCAGGTC GAAGTCTTGC GCAGGGGGGC TAGCAAGACC
TTGAATATTA CACTGGGTGA AATGCCGGCC GACAAGGATG AGGTTGTGCC AACTGCGCAG
CCCGATGCCA AGCCAGAGTC CAATCGCCTG GGGTTGACCC TACGCGAGTT GACGCCACAG
CAGCGTCGTA GCCTCAATGG TCGCAATGCG CTGGTCGTGG TTGATGCGCA AGGTGCTGCT
GCACAGGCAG GCATCCGCAG GGGAGATCTG ATCCTAGCCC TGAACAATAC GGAGGTGCAA
AGCCTGGAGC AGTTCACCAA GCAGGTAAAT GCGGTGCCTG CGGGTAAGAC AGTGGCGTTG
CTCGTGCAGC GGGAAAACAA TACCCTGTAC GTACCAGTCA AGGTTGGCAA GTAA
 
Protein sequence
MFKKLIAMSA ICLFVGMAGA TPVLAKELPD FTELAEKQGA AVVNISVTQV VQSGIGGSPF 
PGFPEDEALN EFFRRFGIPG FPGVPRGQGG PQQPEFKSQS LGSGFIISSD GYILTNAHVV
READEVIVKL NDKREFQAKI VGVDRRTDVA LLKIDATGLP KVTIGNPEQL KVGEWVVAIG
SPFGLESTLT AGVVSAKGRA LPQENFVPFI QTDVAINPGN SGGPLFNLKG EVVGINSQIY
SRTGGYMGLS FAIPIDVAMD VANQLKISGR VARGWLGIGI QEMTKELAES FGMKNTKGAL
VAGVEKGSPA EKGGLEPGDV VIKFDGKDVN VSSDLPRIVG STKPGKKVQV EVLRRGASKT
LNITLGEMPA DKDEVVPTAQ PDAKPESNRL GLTLRELTPQ QRRSLNGRNA LVVVDAQGAA
AQAGIRRGDL ILALNNTEVQ SLEQFTKQVN AVPAGKTVAL LVQRENNTLY VPVKVGK