Gene M446_4533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4533 
Symbol 
ID6129445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4988676 
End bp4989722 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content75% 
IMG OID641644673 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001771308 
Protein GI170742653 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.678765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.937332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCC TCGGCATCGA GACCACCTGC GACGAGACCG CCGCCGCGAT CGTCACCGCG 
GCGGAGGACG GGCGCGCGGT GATCCGCGCC AACGAGGTGC TGAGCCAGAT CGCCGAGCAC
GCGGCCTATG GCGGCGTGGT GCCCGAGATC GCCGCCCGCG CCCACGTGGA GGTGGTCGAC
CGGCTGATCG CCCGGGCGCT CCAGGAGGCG GGCCTGGGCT TCGACGACCT CGACGGCATC
GCGGTCGCGG CCGGCCCGGG GCTGATCGGC GGCGTGCTCG TCGGCCTCGT CACCGCCAAG
ACCCTCTCGC TCGTCACGCG CAAGCCGCTC CTGGCGGTCA ACCACCTGGA GGCGCATGCC
CTGACCGCCC GGATGACGGA CGGGATCGCC TTTCCGTACC TCCTGCTGCT CGCCTCGGGC
GGTCACACCC AGCTCGTGGC CGTCAAGGGC GTGGGCGAGT ATGTCCGCCT CGGCACCACG
ATCGACGACG CGATCGGCGA GGCCTTCGAC AAGGCGGCCA AGCTCCTCGG CCTCGCCTAT
CCGGGCGGGC CCGAGGTCGA GCGGGCCGCC GAGGGCGGCG ATCCGGAGCG CTTCGCCCTG
CCCCGCCCGA TGCTCGGCCG GCGCGAGCCG AACTTCTCCC TCTCGGGCCT CAAGACCGCC
CTGCGGATCG AGGCGGAGCG CATCGCCCCC CTGTCCGGCC AGGATGTCGC CGATCTCTGC
GCCAGCTTCC AGGCGGCGGT GGTGGACGTG GTCGTCGACC GCGTCCGGGT GGCCCTGCGC
GCCTTCGGGG ACGTTGCGGG CCACCCGACC GCCCTGGTGG CGGCGGGCGG CGTCGCCGCC
AACGCGGCCC TGCGGCGGGC GCTCAGCCAG CTCGCGGGCG AGGCCGGGCT GCCCCTGGTC
GCCCCGCCCC TGCCGCTCTG CGGCGACAAC GGCGCGATGA TCGCCTGGGC GGGCCTGGAG
CGCCTGCGGC TCGGCCTCGT CGACGACATC ACGGCGCCGG CCCGCCCGCG CTGGCCCTTC
GCCGAACCCC TCGCGACGGC CGGGTGA
 
Protein sequence
MNVLGIETTC DETAAAIVTA AEDGRAVIRA NEVLSQIAEH AAYGGVVPEI AARAHVEVVD 
RLIARALQEA GLGFDDLDGI AVAAGPGLIG GVLVGLVTAK TLSLVTRKPL LAVNHLEAHA
LTARMTDGIA FPYLLLLASG GHTQLVAVKG VGEYVRLGTT IDDAIGEAFD KAAKLLGLAY
PGGPEVERAA EGGDPERFAL PRPMLGRREP NFSLSGLKTA LRIEAERIAP LSGQDVADLC
ASFQAAVVDV VVDRVRVALR AFGDVAGHPT ALVAAGGVAA NAALRRALSQ LAGEAGLPLV
APPLPLCGDN GAMIAWAGLE RLRLGLVDDI TAPARPRWPF AEPLATAG