Gene Mfla_2315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_2315 
Symbol 
ID4001410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp2470751 
End bp2471710 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content56% 
IMG OID637939241 
Productprolyl aminopeptidase 
Protein accessionYP_546423 
Protein GI91776667 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily
[TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000889953 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000338054 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCACG TATTATTTCC CGACATTCAA CCCTATCAGC AAGAGATGCT GCCCGTCTCA 
GACCTGCATG CTCTTTATTA TGAACAATCG GGCAATCCTG CCGGTCAACC GGTCATTTTC
CTGCACGGCG GCCCAGGAAG CGGTTGCAAT CCAGGGCAAC GCCGCTATTT CGACCCGGGC
CACTACCGCA TTATCCTAGT GGATCAGCGA GGTTGCGGAC GCAGCACGCC GCAAGGAGAA
ATCAGGGAAA ACACGACCAG CCATTTAGTG AATGATCTGG ACACGCTGCG CAAGCACCTG
GGCATTGATC GCTGGCTGGT GTTTGGCGGC TCATGGGGTA GTACGCTGGC GCTGAACTAC
GCTTTGGCAT ATCCGCAACA TGTCACAGGT CTCATTCTCC GCGGCATTTT CCTGAGTCGC
CCGAGCGAGC TGGAGTGGTT TTTGCATGAC GTGCAACACT TTTTTCCTGA GTCCTGGCAT
CGGCTGCTTT CCTACTTGCC TGTTGCTGAA CGGCATGACC CCTTGACTGC ATTCGCGGCA
CGCGTGTTTT CAGATGATCC TGCCGTCAAC GCACCGGCCG CCATCCACTG GAACGCATTC
GAGTCCAGCA TCATGACCTT GCTGCCAGTA ACCGCCACCA GCGAACAGGG CCTCAACCCC
GACATCGAGC TGGCACGAGC CCGCGTGCAA ATCCATTACA TCAAACACCA GTGCTTCCTC
GAGGGACGCA ACCTGATCGC GGAAGCCTCC GCCCAGCTGC GACATATACC TACCGTCATC
GTACAAGGCC GCTACGATAT GGTGTGTCCT CCATTGACAG CATATGAGCT TCACCAGGCC
ATGCCTCATG CAGAATTCCA CATAATTCCG GATGCCGGCC ACTCAGGCAT GGAAGCCGGC
ACCAGGAGCG CCCTGATTGC GGCTACGGAA AAATTCAAGC AAGCTCTGCA ATCAAGATAA
 
Protein sequence
MNHVLFPDIQ PYQQEMLPVS DLHALYYEQS GNPAGQPVIF LHGGPGSGCN PGQRRYFDPG 
HYRIILVDQR GCGRSTPQGE IRENTTSHLV NDLDTLRKHL GIDRWLVFGG SWGSTLALNY
ALAYPQHVTG LILRGIFLSR PSELEWFLHD VQHFFPESWH RLLSYLPVAE RHDPLTAFAA
RVFSDDPAVN APAAIHWNAF ESSIMTLLPV TATSEQGLNP DIELARARVQ IHYIKHQCFL
EGRNLIAEAS AQLRHIPTVI VQGRYDMVCP PLTAYELHQA MPHAEFHIIP DAGHSGMEAG
TRSALIAATE KFKQALQSR